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[Title of Document] DESCRIPTION 

[Title Of Invention] A NUCLEIC ACID CONSTRUCT CONTAINING A NUCLEIC 

ACID DERIVED FROM THE GENOME OF HEPATITIS C VIRUS (HCV) OF 
GENOTYPE 2a, AND A CELL HAVING SUCH NUCLEIC ACID CONSTRUCT 
INTRODUCED THEREIN 
[CLAIMS] 
[Claim I] 

A replicon RNA, comprising a nucleotide sequence containing at least the 5' 
untranslated region, the sequence encoding NS3 protein, NS4A protein, NS4B 
protein, NS5A protein and NS5B protein and the 3' untranslated region on the 

genomic RNA of hepatitis C virus of genotype 2a. 
[Claim 2] 

Tiie repiicon RNA of claim 1, further containing at least one selection marker gene 
or a reporter gene, and at least one IRES sequence. 
[Claim 3] 

A replicon RNA, comprising a nucleotide sequence containing the 5' untranslated 
region comprising the nucleotide sequence represented by SEQ ID NO: 9 or 10; at 
least one selection marker gene or a reporter gene; an IRES sequence; the 
sequence encoding NS3 protein, NS4A protein, NS4B protein, NS5A protein and 
NS5B protein on the genomic RNA of hepatitis C virus of genotype 2a; and the 3' 
untranslated region comprising the nucleotide sequence represented by SEQ ID 
NO: 11 or 12. 
[Claim 4] 

The replicon RNA of any one of claims I to 3, wherein the genomic RNA of 
hepatitis C virus of genotype 2a is an RNA comprising the nucleotide sequence 
represented by SEQ ID NO: 3 or 5. 
[Claim 5] 

A repiicon RNA, comprising the following RNA (a) or (b): 

(a) an RNA comprising the nucleotide sequence represented by SEQ ID NO: I or 
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2; and 

(b) an RNA comprising a nucleotide sequence derived from the nucleotide 
sequence represented by SEQ ID NO: 1 or 2 by deletion, substitution or addition 
of 1 to 10 nucleotides, and being capable of autonomous replication. 

[Claim 6] 

A replicon-repllcating cell, which is prepared by introducing the replicon RNA of 
any one of claims 1 to 5 into a ceil, 
[Claim 7] 

The replicon-replicating cell of claim 6, wherein the cell is a eukaryotic cell. 

[Claim 8] 

The replicon-replicating cell of claim 7, wherein the eukaryotic cell is a human 
liver-derived cell. 
[Claim 9] 

The replicon-replicating cell of claim 7, wherein the eukaryotic cell is any one cell 
selected from the group consisting of an Huh7 cell. 
[Claim 10] 

The replicon RNA of any one of claims 1 to 5, which is for producing or 
evaluating a therapeutic agent or a diagnostic agent for treatment of hepatitis C 

virus infection. 
[Claim 11] 

The replicon-replicating cell of any one of claims 6 to 9, which is for producing or 
evaluating a therapeutic agent or a diagnostic agent for treatment of hepatitis C 
virus infection. 
[Claim 12] 

The replicon RNA of any one of claims 1 to 5, which is for producing a vaccine 
against hepatitis C virus infection. 
[Claim 13] 

The replicon-replicating cell of any one of claims 6 to 9, which is for producing a 
vaccine against hepatitis C virus infection. 
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[Claim 14] 

A method of producing a repHcon RNA of hepatitis C virus of genotype 2a, 
comprising extracting the replicon RNA from the replicon-replicating cell of any 

one of claims 6 to 9. 
[Claim 15] 

A method of producing a viral protein of hepatitis C virus of genotype 2a, 
comprising culturing the replicon-replicating cell of any one of claims 6 to 9, and 
obtaining the viral protein from the resulting culture product, 
[Claim 16] 

A method of screening for a substance promoting or suppressing the replication of 
hepatitis C virus, comprising culturing the replicon-replicating cell of any one of 
claims 6 to 9 in the presence of a test substance, and detecting the replication of a 
repHcon RNA in the resulting culture product. 
[Claim 17] 

A method of introducing a mutation that increases the replication efficiency to the 
replicon RNA of hepatitis C virus of genotype 2a, comprising performing once or 
more the following: obtaining a replicated replicon RNA from the replicon- 
replicating cell of any one of claims 6 to 9, and introducing the thus obtained 
replicated replicon RNA into a cell that is different from the replicon-replicating 
cell so as to prepare a new replicon-replicating cell. 
[Claim 18] 

The method of claim 17, wherein the replication efficiency increases to become at 
least two times greater than that of the replicon RNA that is introduced at the 
beginning into the replicon-replicating cell. 

[Claim 19] 

A method of producing a replicon RNA of hepatitis C virus of genotype 2a having 
increased replication efficiency, comprising performing once or more the 
following: obtaining a replicated replicon RNA from the replicon-replicating cell 
of any one of claims 6 to 9, and introducing the thus obtained replicated replicon 
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RNA into a cell that is different from the replicon-replicating cell so as to prepare 
a new replicon-replicating cell; and obtaining a replicated replicon RNA from the 
finally obtained replicon-replicating cell. 

[Claim 20] 

A method of producing a replicon RNA of hepatitis C virus of genotype 2a having 
increased replication efficiency, comprising detecting a nucleotide mutation or an 
amino acid mutation between the replicon RNA that is produced so as to have an 
increased replication efficiency by the method of claim 19 and the replicon RNA 
that is introduced at the beginning into the replicon-replicating cell; and 
introducing the thus detected nucleotide mutation or amino acid mutation into a 
replicon RNA whose replication efficiency is to be increased. 
[Claim 21] 

A replicon RNA, comprising a nucleotide sequence derived from the nucleotide 
sequence represented by SEQ ID NO: 1 by at least one mutation selected from the 
group consisting of the following (a) to (h); 

(a) a mutation from A to G at nucleotide site 7157; 

(b) a mutation from C to U at nucleotide site 4955; 

(c) a mutation from A to G at nucleotide site 4936; 

(d) a mutation from A to G at nucleotide site 5000; 

(e) a mutation from A to G at nucleotide site 7288; 

(f) a mutation from G to U at nucleotide site 5901 ; 

(g) a mutation from A to U at nucleotide site 61 13; and 

(h) a mutation from A to G at nucleotide site 2890. 
[Detailed Description of Invention] 

[0001] 

[Technical Field of Invention] 

The present invention relates to a replicon RNA of the hepatitis C virus of 
genotype 2a, a replicon-replicating cell wherein the replicon RNA is introduced, 
and a method of increasing the replication efficiency of the replicon RNA. 
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[0002] 

[Conventional Art] 

The hepatitis C virus (HCV) is a virus belonging to the family 
Flaviviridae. It has a single-stranded (+) strand sense RNA as its genome and is 
known to cause hepatitis C. Recent studies have revealed that Hepatitis C virus 
is classified into a number of types based on genotypes or serotypes. According 
to the phylogenetic analysis of Simmonds et al,, using the nucleotide sequences of 
the HCV strains, which is currently a mainstream method of classifying HCV 
genotypes, HCV is classified into 6 genotypes: genotype la, genotype lb, 
genotype 2a, genotype 2b, genotype 3a and genotype 3b (see Non Patent Literature 
I). Each of these types is further classified into several subtypes. Tlie 
nucleotide sequences of the full-length genomes of a several number of genotypes 
of HCV have been determined to date (see Patent Literature 1 and Non Patent 
Literatures 2-5). 
[0003] 

HCV causes chronic hepatitis by persistent infection. Currently, the main 
cause of chronic hepatitis observed worldwide is persistent HCV infection. 
Actually, around 50% of individuals with persistent infection develop chronic 
hepatitis. Chronic hepatitis in approximately 20% of these patients shifts to liver 
cirrhosis over the course of 10 to 20 years, and some of these patients further go 
on to advanced lethal pathological conditions such as hepatic cancer. 
[0004] 

Hepatitis C is currently treated mainly by a therapy using interferon-a or 
interferon-P, or a therapy using in combination interferon-a and ribavirin, the 
purine-nucleoside derivative. However, even when these therapies are 
performed, the therapeutic effects are observed in only approximately 60% of all 
the treated patients. When the therapies are ceased after the exertion of the 
effects, the disease recrudesces in more than half of the patients. The therapeutic 
effect of interferones is known to relate to HCV genotypes, and is said to be lower 
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against genotype lb and higher against genotype 2a (see Non Patent Literature 6). 
[0005] 

it is an important goal to develop therapeutic agents or prophylactic agents 
effective against hepatitis C, the incidence rate of which is high in industrial 
countries, for which currently no causal treatment are present, and which finally 
bring about serious results. Hence, the development of HCV-specific 
chemotherapies and vaccine therapies are earnestly desired. A target for the 
development of an anti-HCV agent may be the suppression of HCV replication or 
the suppression of infection of cells with HCV. 
[0006] 

Until recently, propagation of HCV in a cell culture system and infecting 
cultured cells with HCV have been difficult. Moreover, a chimpanzee has been 
the only animal that can be infected with HCV and can be used in experiments, so 
that it has been difficult to carry out studies on the replication mechanism of HCV 
and the infection mechanism of HCV. However, recently, HCV subgenomic RNA 
replicons have been prepared as HCV-derived autonomously replicable RNA (see 
Patent Literature 2 and Non Patent Literatures 7-10), which enables the analysis of 
the replication mechanism of HCV using cultured cells. These HCV subgenomic 
RNA replicons are each prepared by substituting structural proteins existing 
downstream of HCV IRES in the 5' untranslated region of the HCV genomic RNA 
of genotype lb with a neomycin resistance gene and EMCV IRES that has been 
ligated downstream of the resistance gene. It has been demonstrated that this 
RNA replicon is autonomously replicated in human hepatic cancer cells, Huh? 
cells, when introduced into the Huh? cells followed by culture in the presence of 
neomycin. 
[000?] 

However, regarding such intracellular RNA replication systems for HCV, 
only those using HCV genomic RNA of genotype lb have been prepared so far. 
Since there has been a report that different genotypes of HCV differ also in viral 
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proteins encoded, it may be difficult to sufficiently elucidate the replication 
mechanism of HCV only by analyzing the subgenomic RNA replicons derived 
from HCV of genotype lb. Furthermore, based on the fact that the therapeutic 
effects of interferons differ depending on the HCV genotypes, it may be 
particularly difficult to develop an anti-HCV agent having an effect on various 
types of HCV by the use of only an HCV replication system containing the 
subgenomic RNA replicon of HCV of genotype lb. 
[0008] 

[Patent Literature 1] JP Patent Publication (Kokai) No. 2002-171978 A 

[Patent Literature 2] JP Patent Publication (Kokai) No. 2001-17187 A 

[Non Patent Literature 1] Simmonds, P. et al, Hepatology, (1994) 10, pp. 1321- 

1324 

[Non Patent Literature 2] Choo et al., Science, (1989) 244, pp, 359-362 
[Non Patent Literature 3] Kato et al., J. Med. Virol., (2001) 64(3) pp. 334-339 
[Non Patent Literature 4] Okamoto, H et al, J. Gen. Virol., (1992) 73 pp. 673-679 
[Non Patent Literature 5] Mori, S. et al, Biochem. Biophis. Res. Commun,, 

(1992) 183, pp. 334-342 
[Non Patent Literature 6] Yoshioka et al., Hepatology, (1992) 16(2): pp. 293-299 
[Non Patent Literature 7] Lohniann et al.. Science, (1999) 285, pp. 110-113 
[Non Patent Literature 8] Blight et al.. Science, (2000) 290, pp. 1972-1974 
[Non Patent Literature 9] Friebe et al., J. Virol., (2001) 75(24): pp. 12047-12057 
[Non Patent Literature 10] Ikeda et al., J. Virol.. (2002) 76(6): pp. 2997-3006 

[0009] 

[Problem to be Solved by Invention] 

An object of the present invention is to provide an HCV-derived replicon 
RNA of a HCV genotype for which replicon RNA has not yet been prepared. 
[0010] 

[Means for Solving the Problem] 

As a result of intensive studies to achieve the above object, we have 
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succeeded in preparing the replicon RNAof HCV genotype 2a. 
[0011] 

That is, the present invention is as follows. 
[1] A repiicon RNA, comprising a nucleotide sequence containing at least the 5' 
untranslated region, the sequence encoding NS3 protein, NS4A protein, NS4B 
protein, NS5A protein and NS5B protein and the 3' untranslated region on the 
genomic RNA of hepatitis C virus of genotype 2a. Preferably, this replicon RNA 
further contains at least one selection marker gene or a reporter gene, and at least 
one IRES sequence. 

[2] A replicon RNA, comprising a nucleotide sequence containing the 5' 
untranslated region comprising the sequence represented by either SEQ ID NO: 9 
or 10; at least one selection marker gene or a reporter gene; an IRES sequence; the 
sequence encoding NS3 protein, NS4A protein, NS4B protein, NS5A protein and 
NS5B protein on the genomic RNA of hepatitis C virus of genotype 2a; and the 3' 
untranslated region comprising the nucleotide sequence represented by either SEQ 
ID NO: 11 or 12. 

[3] The replicon RNA of [1] or [2] above, wherein the genomic RNA of hepatitis C 
virus of genotype 2a is an RNA comprising the nucleotide sequence represented by 
SEQ ID NO: 3 or 5, 

[4] A replicon RNA, comprising the following RNA (a) or (b): 

(a) an RNA comprising the nucleotide sequence represented by SEQ ID NO: 1 or 
2; and 

(b) an RNA comprising a nucleotide sequence derived from the nucleotide 
sequence represented by SEQ ID NO: 1 or 2 by deletion, substitution or addition 
of I to 10 nucleotides, and being capable of autonomous replication, 

[5] A replicon-repticating cell, which is prepared by introducing the replicon RNA 
of any one of [I] to [4] above into a cell, For this replicon-replicating cell, a cell 
into which the replicon RNA is introduced is preferably a eukaryotic cell, more 
preferably a human liver-derived cell, and further more preferably an Huh7 cell, 
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[6] The replicon RNA of [1] to [4] above, which is for producing or evaluating a 
therapeutic agent or a diagnostic agent for treatment of hepatitis C virus infection. 

[7] The replicon-replicating cell of [5] above, which is for producing or evaluating 
a therapeutic agent or a diagnostic agent for treatment of hepatitis C virus 
infection, 

[8] The replicon RNA of [1] to [4] above, which is for producing a vaccine against 
hepatitis C virus infection. 

[9] The replicon-replicating cell of [5] above, which is for producing a vaccine 
against hepatitis C virus infection. 

[10] A method of producing a replicon RNA of hepatitis C virus of genotype 2a, 
comprising extracting the replicon RNA from the replicon-replicating cell of [5] 
above, 

[11] A method of producing a viral protein of hepatitis C virus of genotype 2a, 
comprising culturing the replicon-replicating cell of [5] above, and obtaining the 
viral protein from the resulting culture product. 

[12] A method of screening for a substance promoting or suppressing the 

replication of hepatitis C virus, comprising culturing the replicon-replicating cell 
of [5] above in the presence of a test substance, and detecting the replication of a 
replicon RNA in the resulting culture product, 

[13] A method of introducing a mutation that increases the replication efficiency 
to the replicon RNA of hepatitis C virus of genotype 2a, comprising performing 
once or more the following: obtaining a replicated replicon RNA from the 
replicon-replicating cell of [5] above, and introducing the thus obtained replicated 

replicon RNA into a cell that is different from the replicon-replicating cell so as to 
prepare a new replicon-replicating cell. In this method, it is more preferred that 
the replication efficiency increases to become preferably at least two times greater 
than that of the replicon RNA that is introduced at the beginning into the replicon- 
replicating cell. 

[14] A method of producing a replicon RNA of hepatitis C virus of genotype 2a 
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having increased replication efficiency, comprising performing once or more the 
following: obtaining a replicated repiicon RNA from the replicon-repiicating cell 
of [5] above, and introducing the thus obtained replicated repiicon RNA into a cell 
that is different from the replicon-repiicating cell so as to prepare a new replicon- 
repiicating cell; and obtaining a replicated repiicon RNA from the finally obtained 
replicon-repiicating cell. 

[15] A method of producing a repiicon RNA of hepatitis C virus of genotype 2a 
having increased replication efficiency, comprising detecting a nucleotide 
mutation or an amino acid mutation between the repiicon RNA that is produced so 
as to have an increased replication efficiency by the method of [14] above and the 
repiicon RNA that is introduced at the beginning into the replicon-repiicating cell; 
and introducing the thus detected nucleotide mutation or amino acid mutation into 
a repiicon RNA whose replication efficiency is to be increased. 
[16] A repiicon RNA, comprising a nucleotide sequence derived from the 
nucleotide sequence represented by SEQ ID NO: I by at least one mutation 
selected from the group consisting of the following (a) to (h); 

(a) a mutation from A to G at nucleotide site 7157; 

(b) a mutation from C to U at nucleotide site 4955; 

(c) a mutation from A to G at nucleotide site 4936; 

(d) a mutation from A to G at nucleotide site 5000; 

(e) a mutation from A to G at nucleotide site 7288; 

(f) a mutation from G to U at nucleotide site 5901; 

(g) a mutation from A to U at nucleotide site 6113; and 

(h) a mutation from A to G at nucleotide site 2890. 
[0012] 

[Mode for Carrying out Invention] 

The present invention is explained in detail as follows. 
1. nCV-derived repiicon RNA according to the present invention 

The genome of hepatitis C virus (HCV) is a single-stranded (+) strand 
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RNA comprising approximately 9600 nucleotides. This genomic RNA comprises 
the 5' untranslated region (also denoted as 5' NTR or 5' UTR), a translated region 
composed of a structural region and a non-structural region and tlie 3' untranslated 
region (also denoted as 3' NTR or 3' UTR). HCV structural proteins are encoded 
in the structural region, and a plurality of non-structural proteins are encoded in 
the non-structural region, 
[0013] 

Such HCV structural proteins and non-structural proteins are generated 
through the translation into a continuous form thereof, a polyprotein, from the 
translated region, restricted degradation of the polyprotein by protease, and then 
the release of the structural proteins (Core, El and E2) and non-structural proteins 
(NS2, NS3, NS4A, NS4B, NS5A and NS5B), respectively. Among these 
structural proteins and non-structural proteins, that is, viral proteins of HCV, Core 
is a core protein. El and E2 are envelope proteins, and non-structural proteins 
(NS2, NS3, NS4A, NS4B, NS5A and NS5B) are proteins involved in virus's own 
replication. NS2 is known to have metalloprotease activity, and NS3 is known to 
have serine protease activity (at one-third of the N terminal side) and helicase 
activity (at two-thirds of the C-terminal side). Furthermore, NS4A is a cofactor 
for protease activity of NS3, and NS5B has been reported to have RNA-dependent 
RNA polymerase activity, Furthermore, the genome of HCV of genotype 2a has 
already been reported to have a similar gene structure (see Patent Literature 1). 
[0014] 

We have constructed RNA capable of autonomous replication using such 
HCV genome of genotype 2a. Specifically, the HCV-derived replicon RNA of the 
present invention is an RNA construct, which contains the whole or partial RNA of 
the HCV genome of genotype 2a and is capable of autonomous replication. 

[0015] 

In this specification, RNA that is prepared by altering the viral genome of 
HCV and is capable of autonomous replication is referred to as "replicon RNA" or 
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"RNA replicon." RNA that is artificially prepared from HCV of genotype 2a and 
is capable of autonomous replication is referred to as "replicon RNA derived from 
HCV of genotype 2a." In this specification, the HCV-derived replicon RNA is 
aiso referred to as an HCV-RNA replicon. 
[0016] 

In the present invention, "hepatitis C virus of genotype 2a" or "HCV of 
genotype 2a" means hepatitis C virus identified as genotype 2a according to the 
internationai classification of Simmonds et al. The "hepatitis C virus of genotype 
2a" or the "HCV of genotype 2a" of the present invention encompasses not only a 
virus having naturally occurring HCV genomic RNA, but also a virus having 
genomic RNA prepared by artificially altering a naturally occurring HCV genomic 
sequence. Specific examples of HCV of genotype 2a include viruses of JFH-1 
strain and the JCH-1 strain (see Patent Literature 1). 
[0017] 

Furthermore, "the genomic RNA of hepatitis C virus of genotype 2a" 
means RNA that comprises the single-stranded (+) strand sense RNA of hepatitis C 
virus of genotype 2a and has the nucleotide sequence throughout the entire region 
of its genome. The genomic RNA of hepatitis C virus of genotype 2a is 
preferably RNA comprising the nucleotide sequence represented by SEQ ID NO; 3 
or 5, but is not limited thereto. 
[0018] 

In the specification of the present application, "5' untranslated region" 
(5'NTR or 5'UTR), "a sequence encoding NS3 protein, NS4A protein, NS4B 

protein, NS5A protein and NS5B protein," "a sequence encoding Core protein" 
(Core region or C region), "a sequence encoding El protein" (El region), "a 
sequence encoding B2 protein" (E2 region), "a sequence encoding N2 protein" 
(NS2 region), "a sequence encoding NS3 protein" (NS3 region), "a sequence 
encoding NS4A protein" (NS4A region), "a sequence encoding NS4B protein" 
(NS4B region), "a sequence encoding NS5A protein" (NS5A region), "a sequence 
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encoding NS5B protein" (NS5B region) and "3' untranslated region" (3' NTR or 3' 
UTR), and other specific regions or sites are determined based on the nucleotide 
sequence of SEQ ID NO; 3 of the full-length cDNA (JFH-1 clone) encoding the 
entire region of the genome of the JFH-1 strain, which is HCV of genotype 2a. 
The nucleotide sequence of SEQ ID NO: 3 can be obtained from the International 
DNA Data Bank (DDBJ/EMBL/GenBank) by referring to the accession No. 
AB047639. Specifically, when a particular HCV RNA sequence is aligned with 
the nucleotide sequence represented by SEQ ID NO: 3, a sequence to be aligned 
with nucleotides I to 340 on the nucleotide sequence represented by SEQ ID NO; 
3 is "5' untranslated region" of the RNA, a sequence to be aligned with the 
nucleotides 3431 to 9442 on the same are a sequence encoding NS3 protein, NS4A 
protein, NS4B protein, NS5A protein and NS5B protein, a sequence to be aligned 
with the nucleotides 3431 to 5323 on the same is "a sequence encoding NS3 
protein," a sequence to be aligned with the nucleotides 5324 to 5485 on the same 
is "a sequence encoding NS4A protein," a sequence to be aligned with the 
nucleotides 5486 to 6268 on the same is a sequence encoding NS4B protein," a 
sequence to be aligned with the nucleotides 6269 to 7666 on the same is "a 
sequence encoding NS5A protein," a sequence to be aligned with the nucleotides 
7667 to 9442 on the same is "a sequence encoding NS5B protein," and a sequence 
to be aligned with the nucleotides 9443 to 9678 on the same is "3' untranslated 
region." Furthermore, in this case, gaps, additions, deletions, substitutions or the 
like may be present in the "aligned" sequences. Furthermore, the above 
"particular HCV" is not limited thereto, and includes the JFH-1 strain or JCH-1 
strain, or viral strains that are derivatives thereof. 
[0019] 

One embodiment of the HCV RNA-repIicon according to the present 
invention is a replicon RNA comprising a nucleotide sequence containing at least 
the 5' untranslated region, a sequence encoding NS3 protein, NS4A protein, NS4B 
protein, NS5A protein and NS5B protein, and the 3' untranslated region on the 
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genomic RNA of hepatitis C virus of genotype 2a. The replicon RNA may further 
contain at least one selection marker gene or one reporter gene, and at least one 
IRES sequence. Furthermore, this replicon RNA may also contain a sequence 
encoding a viral protein other than NS3, NS4A, NS4B, NS5A and NS5B proteins 
on the genomic RNA of hepatitis C virus of genotype 2a. 
[0020] 

Another preferred embodiment of HCV RNA-replicon according to the 
present invention is a replicon RNA comprising a nucleotide sequence containing 
the 5' untranslated region comprising the nucleotide sequence represented by SEQ 
ID NO: 9 or 10, at least one selection marker gene or reporter gene, the IRES 
sequence, a sequence encoding NS3 protein, NS4A protein, NS4B protein, NS5A 
protein and NS5B protein on the genomic RNA of hepatitis C virus of genotype 
2a, and the 3' untranslated region comprising the nucleotide sequence represented 
by SEQ ID NO: 11 or 12. In this case the nucleotide sequences represented by 
SEQ ID NO: 9 and 10 are sequences of the 5' untranslated regions of rSGREP- 
JFHl (SEQ ID NO: 1) and rSGREP-JCHl (SEQ ID NO: 2), respectively, which are 
replicon RNAs according to the present invention. Furthermore, the nucleotide 
sequences represented by SEQ ID NO; II and 12 are sequences of the 3' 
untranslated regions of rSGREP-JFHI (SEQ ID NO: 1) and rSGREP-JCHl (SEQ 
ID NO: 2), respectively, which are replicon RNAs according to the present 
invention. 
[0021] 

A more preferred embodiment of HCV RNA-replicon according to the 
present invention is a replicon RNA comprised of an RNA comprising the 
nucleotide sequence represented by SEQ ID NO: I or 2. Furthermore, a replicon 
RNA comprising a nucleotide sequence derived from the nucleotide sequence 
represented by SEQ ID NO: 1 or 2 by deletion, substitution or addition of 1 to 50, 
1 to 30, 1 to 10, 1 to 6, or 1 to several (2 to 5) nucleotides, and being capable of 
autonomous replication is also included in the scope of the present invention as a 

14 



JP Application No. 2003-148242 



preferred embodiment, In the present invention, "capable of autonomous 
replication" means that when replicon RNA is introduced into a cell, the replicon 
RNA allows its own full-length sequence to be replicated within the cell. For 
example, this ability of autonomous replication can be confirmed by transfecting 
replicon RNA into Huh7 cells, cuUuring the Huh? cells, extracting RNA from the 
cells in the thus resulting culture product and conducting Northern blot 
hybridization for the extracted RNA using a probe that can specifically detect the 
transfected replicon RNA so as to detect the presence of the replicon RNA, 
However, examples of such a method are not limited thereto. Specific procedures 
for confirming the ability of autonomous replication can be conducted according 
to descriptions given in the Examples of this specification such as those for 
measuring the ability of colony formation, those for confirming the expression of 
HCV proteins or those for detecting replicon RNA. 
[0022] 

In the present invention, a "selection marker gene" means a gene that can 
provide a cell with selectivity such that only the cell expressing the gene is 
selected. A general example of a selection marker gene is an antibiotic resistance 
gene. In the present invention, preferred examples of a selection marker gene 
include a neomycin resistance gene, a thymidine kinase gene, a kanamycin 
resistance gene, a pyrithiamine resistance gene, an adenylyl transferase gene, a 
Zeocin resistance gene and a puromycin resistance gene. The neomycin 
resistance gene and the thymidine kinase gene are preferred, and the neomycin 
resistance gene is more preferred. However, the selection marker gene in the 
present invention is not limited to these genes. 
[0023] 

Furthermore in the present invention, a "reporter gene" means a marker 
gene encoding a gene product that is a marker for the expression of the gene. 
General examples of a reporter gene include structural genes of enzymes that 
catalyze light emitting reaction or color reaction. Preferred examples of the 
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reporter gene in the present invention include a transposon Tn9-derived 
chloramphenicol acetyltransferase gene, an Escherichia coli-derived p 
gkicuronidase or p galactosidase gene, a hiciferase gene, a green fluorescence 
protein gene, an aequorin gene from jellyfish, and a secreted placental alkaline 
phosphatase (SEAP) gene. However, the reporter gene in the present invention is 
not limited to these genes. 
[0024] 

Either only one or both of the above selection marker gene and reporter 
gene may be contained in replicon RNA. 
[0025] 

in the present invention, "IRES sequence" means an internal ribosome 
entry site that allows translation to be initiated by binding ribosomes within the 
inside of RNA. Preferred examples of IRES sequence in the present invention 
include, but are not limited to, EMCV IRES (the internal ribosome entry site of 
encephalomyocarditis virus), FMDV IRES and HCV IRES. EMCV IRES and 
HCV IRES are more preferred, and EMCV IRES is the most preferred sequence. 
[0026] 

The replicon RNA according to the present invention may further contain a 
sequence on the genomic RNA of another HCV strain or HCV of another genotype. 
For example, the replicon RNA may also contain a fragment of HCV genome of 
genotype lb. Examples of another HCV strain include, but are not limited to, 
HCV-1, HCV-H, HC-Jl, FICT-18, H77, DK-7, USll, SI4, HCT23, HCV-Th, DRl, 
DR4, HCT27, SI 8, SWl, DK9, H90, TD-6E1, S9, HCV-BK, TIO, DKl, HC-J4, 
HCV-J, HK3, HK8, HK5, HCV-G3, IND5, IND8, PIO, Dl, D3, SW2, T3, S45, 
SAIO, US6, HCV-JKl, HCV-JK4, HCV-JK3, HCV-JK2, HCV-JT, HC-J2, HCV-T, 
HK4, HC-G9, Zl, Bi, S. I., Cho, J.M., HCV-J6, T4, T9, USIO, HC-J5, T2, HC-J7, 
DKIl, SW3, DK8, T8, HC-J8, S83, HK2, HC-J6, HC-J8, BEBEl, HCV-J6, HCV- 
J8, HDIO-2, BR36-9, S52, S54, S2, BR33-1, HKIO, DK12, HCV-TR, BA-1, BA-2, 
DK13, ZI, Z4, Z6, Z7, HK2, SAl, SA4, SA5, SA7, SA13, SA6, NZLl, SA30, EG- 
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13. HCV-K3a/650, ED43, EUH1480, EUIIK2, Th580, VN235, VN405, VN004. 
JK049, JK046, JFH-1, JCH-1, JCH-2, JCH-3. JCH-4, JCH-5, JCH-6, J6CF and 
H77. 
[0027] 

The replicon RNA according to the present invention preferably has the 5' 
untranslated region on the genomic RNA of HCV of genotype 2a on the 5'-most 
side, and the 3 'untranslated region on the genomic RNA of HCV of genotype 2a 
on the 3'-most side. A selection marker gene or a reporter gene may be ligated 
upstream of the IRES sequence, or upstream or downstream of "the sequence 
encoding NS3 protein, NS4A protein, NS4B protein, NS5A protein and NS5B 
protein," or inserted in the middle of "the sequence encoding NS3 protein, NS4A 
protein, NS4B protein, NS5A protein and NS5B protein." 
[0028] 

The replicon RNA according to the present invention more preferably has 
the 5' untranslated region on the genomic RNA of HCV of genotype 2a on the 5'- 
most side, and a selection marker gene or a reporter gene, the IRES sequence and 

"the sequence encoding NS3 protein, NS4A protein, NS4B protein, NS5A protein 
and NS5B protein" downstream of the 5' untranslated region in this order, and the 
3' untranslated region on the genomic RNA of HCV of genotype 2a on the 3'-most 
side. 
[0029] 

Examples of the replicon RNA according to the present invention may 
include an RNA containing any foreign gene to be expressed within a cell into 

which the replicon RNA is introduced, in addition to the sequences as described 
above, A foreign gene may also be ligated downstream of the 5' untranslated 
region, or ligated upstream or downstream of a selection marker gene or a reporter 
gene, or ligated upstream or downstream of "the sequence encoding NS3 protein, 
NS4A protein, NS4B protein, NS5A protein and NS5B protein," or may be inserted 
in the middle of "the sequence encoding NS3 protein, NS4A protein, NS4B protein, 
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NS5A protein and NS5B protein." A replicon RNA containing a foreign gene can 
express a protein encoded by the foreign gene wlien it is translated within a cell 
into which the RNA is introduced. Thus, the replicon RNA containing a foreign 
gene can be appropriately used also for gene therapy or the like, the purpose of 
which is to generate a particular gene product within a cell. 
[0030] 

The replicon RNA according to tiie present invention may further contain a 
ribozyme. A ribozyine is inserted to ligate a selection marker gene, a reporter 
gene or a foreign gene on the 5' side in the replicon RNA to those located on the 3' 
side thereof including the IRES sequence and "the sequence encoding NS3 protein, 
NS4A protein, NS4B protein, NS5A protein and NS5B protein," so that it enables 
cleavage and separation of the two by the self-cleavage activity of the ribozyme. 
[0031] 

In the replicon RNA according to the present invention, the above 
described selection marker gene, reporter gene, sequences encoding viral proteins 
on the genomic RNA of hepatitis C virus of genotype 2a, sequences encoding viral 
proteins of HCV of a genotype other than genotype 2a, a foreign gene or the like 
are ligated so that they are translated from the replicon RNA in the correct reading 
frame. Among these sequences, the protein-coding sequences may be ligated to 
each other via a protease cleavage site and the like, so that after the proteins are 
expressed as a fusion protein with the polyprotein that is translated from "the 
sequence encoding NS3 protein, NS4A protein, NS4B protein, NS5A protein and 
NS5B protein" of hepatitis C virus of genotype 2a, the fusion protein is separated 
by protease into each protein. 
[0032] 

2. Preparation of replicon RNA according to the present invention 

The IICV RNA-replicon according to the present invention can be prepared 
using any genetic engineering techniques known by persons skilled in the art. 
The HCV RNA-replicon can be prepared by, for example, the following method, 
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but the method of preparation is not limited tliereto, 
[0033] 

First, DNA corresponding to the entire region of the genomic RNA of 
hepatitis C virus of genotype 2a is ligated downstream of an RNA promoter 
according to a standard procedure so as to prepare a DNA clone, As used herein, 
"DNA corresponding to RNA" means a DNA liaving a nucleotide sequence derived 
from the nucleotide sequence of the RNA by substituting U (uracil) with T 
(thymine). The above RNA promoter is preferably an RNA promoter contained in 
a plasmid clone. An example of an RNA promoter is not limited, but T7 RNA 
promoter is particularly preferred. 
[0034] 

Next, for the thus prepared DNA clone, for example, the structural region 
(Core sequence, El sequence and E2 sequence) located downstream of the 5' 
untranslated region and the sequence encoding NS2 protein are substituted with a 
DNA fragment containing a selection marker gene or a reporter gene and the IRES 
sequence ligated downstream thereof, In this substitution, portions other than the 
structural region, such as a fragment on the 3' terminal side of the 5' untranslated 
region or a part of the sequence encoding NS3 protein may be substituted with a 
sequence derived from HCV of another genotype. 
[0035] 

Subsequently, using the DNA clone after the substitution as a template, 
RNA is synthesized using RNA polymerase. RNA synthesis can be initiated by a 
standard procedure from the 5' untranslated region and the IRES sequence. When 
a template DNA is a plasmid clone, the above DNA region ligated downstream of 
an RNA promoter is excised by a restriction enzyme from the plasmid clone, and 
then RNA can be synthesized using the DNA fragment as a template. In addition, 
preferably the 3' terminus of RNA to be synthesized agrees with the 3' untranslated 
region of the viral genomic RNA, and no other sequences are added or deleted. 
The thus synthesized RNA is the replicon RNA according to the present invention. 
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[0036] 

3. Preparation of icplicon-replicating cells into which replicon RNA from HCV of 
genotype 2a is introduced 

The replicon RNA that is prepared as described above is introduced into 
cells in which the replicon RNA should be replicated, so that cells wherein the 
replicon RNA is continuously replicated can be obtained. In this specification, a 
cell wherein replicon RNA is continuously replicated is referred to as a "replicon- 
replicating cell." 
[0037] 

As a cell into which replicon RNA is introduced, any cell can be used, as 
long as it can be subcultured. Such a cell is preferably a eukaryotic cell, more 
preferably a human liver-derived cell, and further preferably Huh7 cells. As 
these cells, commercially available ceils may be utilized, these cells may be 
obtained from cell depositories, or cell lines established from any cells (e.g., 
cancer cells or stem ceils) may also be used. 
[0038] 

Introduction of replicon RNA into cells can be performed using any 
technique known by persons skilled in the art. Examples of such an introduction 
method include electroporation, a particle gun method, a lipofection method, a 
calcium phosphate method, a microinjection method and a DEAE sepharose 
method. The method using electroporation is particularly preferred. 
[0039] 

A replicon RNA of interest may be introduced alone, or may be introduced 
after it is mixed with other nucleic acids. To vary the quantity of replicon RNA 
while keeping RNA quantity to be introduced at a certain level, the replicon RNA 
of interest is mixed with total cellular RNA extracted from cells into which the 
RNA is introduced, and then the mixture is used for introduction into cells. The 
quantity of replicon RNA to be used for introduction into cells may be determined 
depending on the introduction method employed, and is preferably between I 
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picogram and 100 micrograms, and more preferably between 10 picograms and 10 

micrograms. 

[0040] 

When replicon RNA containing a selection marker gene or a reporter gene 
is used for introduction into cells, cells wherein the replicon RNA is introduced 
and continuously replicated can be selected utilizing the expression of the 
selection marker gene or the reporter gene. Specifically, for example, such cells 
into which replicon RNA has been introduced may be cultured in media whereby 
the cells can be selected by the expression of the selection marker gene or the 
reporter gene. As an example, when replicon RNA contains a neomycin 
resistance gene as a selection marker gene, ceils into which replicon RNA has 
been intracellularly introduced are seeded into a culture dish. After 16 to 24 
hours of culture, G418 (neomycin) is added to the culture dish at a concentration 
of 0,05 milligrams/milliliter to 3.0 milligrams/milliliter. The cells are 
continuously cultured for preferably 10 days to 40 days and more preferably 14 
days to 28 days after seeding, while exchanging the cuhure solution twice a week. 
Next, surviving cells are stained with crystal violet, so that cells into which the 
replicon RNA has been introduced and is being continuously replicated can be 
selected as formed colonies. 
[0041] 

Cloned cells can be obtained from the formed colonies by cloning 
surviving cells by a standard procedure, and then continuing the culture of the 
cells. The thus obtained cell clone wherein the replicon RNA of interest is 
continuously replicated is referred to as "a replicon-replicating cell clone" in this 
specification. 
[0042] 

Regarding the established cell clone, detection of a replicon RNA that has 
been replicated from the introduced replicon RNA in the cell clone, confirmation 
of the presence or the absence of the incorporation of a selection marker gene or a 
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reporter gene in tlie introduced repiicon RNA into a host genomic DNA, and 
confirmation of the expression of an HCV protein are preferably carried out to 
confirm the fact that a repiicon RNA of interest is actually and continuously 
replicated. 
[0043] 

A repiicon RNA that has been replicated from the introduced repiicon RNA 
in the cell clone (in this specification, hereinafter conveniently referred to as 
"replicated repiicon RNA") may be detected according to any RNA detection 
method known by persons sltilled in the art. For example, detection can be 
performed by conducting the Northern hybridization method for total RNA 
extracted from the cell clone using as a probe a DNA fragment specific to the 
introduced repiicon RNA. 
[0044] 

Furthermore, the presence or the absence of the incorporation of a 
selection marker gene or a reporter gene in the introduced repiicon RNA into a 
host genomic DNA can be confirmed by, for example, performing PGR for the host 
genomic DNA extracted from the cell clone to amplify at least a part of the 
selection marker gene or the reporter gene, and then confirming the presence or 
the absence of the amplified product. However, examples of relevant methods 
are not limited thereto. A ceil clone for which the amplified product is confirmed 
is considered to have a selection marker gene or a reporter gene incorporated in 
the host genome. Thus, regarding the cell clone, the repiicon RNA itself may not 
be continuously replicated within the cell. In this case, whether or not the 
repiicon RNA is continuously replicated can be confirmed by conducting an 
experiment to confirm the expression of an HCV protein, as described below. 
[0045] 

The expression of an HCV protein can be confirmed by, for example, 
causing an antibody against an HCV protein to be expressed from the introduced 
repiicon RNA and to react with a protein extracted from a cell clone. This 
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method can be conducted by any protein detection metliod known by persons 
skilled in the art. Specifically, for example, a protein sample extracted from the 
cell clone is blotted onto a nitrocellulose membrane, with which an anti-HCV 
protein antibody (e.g., an anti-NS3-specific antibody or an antiserum collected 
from a hepatitis C patient) is reacted, and then the anti-HCV protein antibody is 
detected. If the HCV protein is detected among proteins extracted from the cell 
clone, it can be concluded that this cell clone continuously replicate HCV-derived 
replicon RNA to express the HCV protein. 
[0046] 

As described above, cell clones confirmed to continuously replicate a 
replicon RNA of interest (replicon-replicating cell clones) can be obtained. 
Furthermore in the present invention, replicon RNA can be obtained by any 
method known by persons skilled in the art, for example, by extracting RNA from 
the replicon-replicating ceil, and then separating replicon RNA from the RNA by 
an electrophoresis method. The present invention also relates to such a method 
of producing replicon RNA. Moreover, preferably, the replicon-replicating cell 
according to the present invention can be used for producing HCV proteins. 
Persons skilled In the art can obtain HCV proteins from the replicon-replicating 
cells according to any standard method. Specifically, for example, a viral protein 
of hepatitis C virus of genotype 2a can be produced by ciilturing replicon- 
replicating cells, collecting proteins from the resulting culture product {including 
cultured cells and culture media) by a standard procedure, and then selectively 
obtaining viral proteins from the proteins by detection or the like using an anti- 
HCV protein antibody. 
[0047] 

Moreover, when the replicon-replicating cell according to the present 
invention continuously replicates replicon RNA containing a foreign gene, a 
protein encoded by tlie foreign gene can be obtained by the expression thereof 
using the replicon-replicating cell. Specifically, for example, the protein encoded 
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by a foreign gene can be obtained by culturing replicon-replicating cells, 
collecting proteins from the resulting culture product (including cultured cells and 
culture media) by a standard procedure, and then selectively obtaining the protein 
from among the proteins by detection or the like using an antibody against the 
protein of interest. 
[0048] 

4. Introduction of mutation that increases replication efficiency into replicon RNA 
from HCV of genotype 2a 

iVIutation producing enhancement of replication efficiency frequently takes 
place in the replicon RNA that is replicated or generated in the replicon- 
replicating cell (replicated replicon RNA) according to the present invention. 
Such a mutation may be an adaptive mutation. 
[0049] 

Utilizing this fact, introduction of a mutation enhancing replication 
efficiency into the replicon RNA according to the present invention can be 
promoted in the present invention. 
[0050] 

Specifically, the step comprising obtaining a first replicated replicon RNA 
by extraction or the like from a first replicon-replicating cell (preferably, a 
replicon-replicating cell, wherein the replicon RNA according to the present 
invention has been introduced), and then re-introducing the first replicated 
replicon RNA into another cell to prepare a second replicon-replicating cell is 
performed repeatedly once or more, preferably I to 10 times, more preferably 1 to 
5 times, and further preferably 1 to 2 times, so that the mutation increasing 
replication efficiency can be introduced at a high frequency into the replicon RNA 
within the replicon-replicating cells. 
[0051] 

As a cell into which a replicated replicon RNA is re-introduced, any cell 
can be used. Such a cell is preferably derived from a biological species that is 
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the same as that of a cell wherein replicon RNA is introduced at the beginning, 
more preferably derived from the same tissue derived from the same biological 
species as that of a cell wherein replicon RNA is introduced at the beginning, and 
further preferably of a cell line that is the same as that for a cell wherein replicon 
RNA is introduced at the beginning. 
[0052] 

Therefore in the present invention, using the above method, replicon RNA 
wherein the mutation increasing replication efficiency is introduced can be 
produced. Specifically, the step comprising obtaining a first replicated replicon 
RNA by extraction or the like from a first replicon-replicating cell (preferably, a 
replicon-replicating cell, Into which the replicon RNA according to the present 
invention has been introduced), and then re- introducing the first replicated 
replicon RNA into another cell so as to prepare a second replicon-replicating cell 
is performed repeatedly once or more, preferably 1 to 1 0 times, more preferably 1 
to 5 times, and further preferably 1 to 2 times. Subsequently, the replicated 
replicon RNA is obtained by extraction or the like from the replicon-replicating 
cell finally obtained at the end of the repeated steps, so that replicon RNA with 
increased replication efficiency can be produced. 
[0053] 

In the present invention, the replication efficiency of a replicon RNA can 
be increased at least 2 times, preferably 10 to 100 times, and more preferably 100 
to 10000 times by the above method. 
[0054] 

Regarding the replicon RNA that is produced by such a method so as to 
have increased replication efficiency, the nucleotide sequence is preferably 
determined by a known method, for example, by obtaining cDNA by reverse 
transcription PGR and subjecting such cDNA to sequencing. Furthermore, the 
thus determined nucleotide sequence or the amino acid sequence encoded by the 
nucleotide sequence is compared with the nucleotide sequence of replicon RNA 
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that had been introduced at the beginning into ceils, so that adaptive mutation can 
be identified. As adaptive mutation increasing replication efficiency, in 
particular, nonsynonymous substitution that mutates an amino acid in a vira! 
protein encoded by replicon RNA is preferred. 
[0055] 

The present invention also provides a method whereby the replicon RNA 
of hepatitis C vims of genotype 2a having Increased replication efficiency can be 
produced by introducing the thus identified adaptive mutation into replicon RNA, 
the replication efficiency of which is to be increased, by a standard procedure. 
[0056] 

The replicon RNA that is produced as described above so as to have 
increased replication efficiency can be used for producing replicon RNA in large 
quantity within cells that have been used for the method, 

[0057] 

The replication efficiency of the replicon RNA according to the present 
invention can be determined by a method known by persons skilled in the art. 
For example, it can be determined according to the following method. Replicon 
RNAs are transfected in quantities of 0.0001, 0.0003, 0.001, 0.003, 0.01, 0.03, 0.1, 
0.3 and 1,0 micrograms, respectively, into Huh? cells, selective culture with G418 
is performed for 21 days in a method similar to the above experimental techniques, 
and then the number of colonies formed (number of colonies) is counted. The 
quantity of RNA introduced is compared with the number of colonies formed to 
determine the range of the quantity of the replicon RNA introduced, within which 
colony formation increases in a quantity-dependent manner. The number of 
colonies formed within the range is divided by the quantity of RNA introduced, 
and the resulting value is regarded as the colony forming activity per microgram. 
This equation is as follows. 

Colony forming activity [{Colony Forming Unit, or CFU)/microgram] = 
Number of colonies formed [colony] / quantity of RNA introduced [microgram] 
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[0058] 

The thus calculated colony forming activity is regarded as a value 
representing the replication efficiency of replicon RNA introduced. Specifically, 
the higher the colony forming activity, the higlier the replication efficiency of the 
replicon RNA. 
[0059] 

In addition, the replication efficiency of replicon RNA can also be shown 
via a colony-forming ability that is represented by the number of copies of the 
replicon RNA introduced per formed colony. That is. in this case, the ability can 
be calculated according to the following equation. 

Colony forming ability = number of copies of replicon RNA introduced 
[copy] / number of formed colonies [colony] 
[0060] 

5. Other embodiments o f the nresent invention 

The replicon RNA-replioating cell according to the present invention can 
also be used as a test system for. for example, screening for a substance that 
promotes or suppresses the replication of hepatitis C virus. Specifically, for 
example, replicon replicating cells are cultured in the presence of a test substance, 
replication of the replicon RNA in the resulting culture product is detected, and 
then whether or not the test substance promotes or suppresses the replication of 
the replicon RNA is determined, so that a substance that promotes or suppresses 
the replication of hepatitis C virus can be screened for. In this case, detection of 
the replication of the replicon RNA in the resulting culture product may be 
conducted by detecting the quantity of, or the presence or the absence of, the 
replicon RNA in the RNAs extracted from the replicon RNA-replicating cell, or by 
detecting the quantity of, or the presence or the absence of, HCV protein contained 
in the proteins in the culture product or in the replicon RNA-replicating cells 
contained in the culture product. 
[0061] 
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agent or a therapeutic agent for hepatitis C virus infection. 

(7) Production of a vaccine antigen that can be used as a vaccine against HCV of 

genotype 2a, 

(8) Production of hepatic cell-directed genetic vector that is used after the 
incorporation of a foreign gene therein for gene therapy. 

[0064] 
[Examples] 

The present invention will be described more specifically based on the 
following examples and drawings. However, the technical scope of the present 
invention is not limited by these examples. 

[0065] 

[Example 1] Preparation of replicon RNA 
(A) Construction of expression vector 

DNA corresponding to the entire region of viral genome of hepatitis C 
virus JFH-1 strain (genotype 2a) that had been separated from patients with 
fulminant hepatic failure was obtained from a JFH-1 clone containing the full- 
length genomic cDNA of the virus strain. The DNA was inserted downstream of 
T7 RNA promoter sequence that had been inserted in pUC19 plasmtd. The thus 
constructed plasmid DNA is hereinafter referred to as pJFHI. Similarly, DNA 
corresponding to the entire region of viral genome of hepatitis C virus JCH-1 
strain (genotype 2a) that had been separated from patients with chronic hepatitis 
was obtained from a JCH-1 clone containing the full-length genomic cDNA of the 
virus strain. The DNA was inserted downstream of the T7 RNA promoter 
sequence that had been inserted in pUC19 plasmid. The thus constructed plasmid 
DNA is hereinafter referred to as pJCHl. In addition, the preparation of the 
above JFHl clone and JCH-1 clone is described in Patent Literature 1 and Non 
Patent Literature 3. Moreover, the nucleotide sequence of the full-length cDNA 
of JFH-1 clone was registered at the International DNA Data Bank 
(DDBJ/EMBL/GenBank) under accession No. AB047639, and the nucleotide 
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sequence of the fiilMength cDNA of the JCH-1 clone under accession No. 
AB047640. 

[0066] 

The structures of the thus constructed piasmid DNA pJFHl and pJCHl are 
shown in the upper section of Fig. 1. "T7" represents T7 RNA promoter, and "G" 
represents dGTP inserted upstream of the 5' end of the inserted JFH-1- or JCH-1- 
derived DNA and downstream of the 3' end of T7 RNA promoter sequence. A 
region from "5' NTR" to "3' NTR" is DNA corresponding to the entire genomic 
region of hepatitis C virus. 
[0067] 

Next, the structural regions and a part of the non-structural regions of 
piasmid DNA pJFHl and pJCHl were substituted with a neomycin resistance gene 
(neo; also referred to as a neomycin phosphotransferase gene) and EMCV-IRES 
(internal ribosome entry site of encephalomyocardltis virus), thereby constructing 
piasmid DNA pSGREP-JFHI and pSGREP-JCHl, respectively {lower section of 
Fig. 1). This construction procedure was conducted according to a previous 
report (Non Patent Literature 7), Specifically, piasmid pJFHl and pJCHI were 
cleaved with restriction enzymes Age I and Cla I, and between the Age I and Cla I 
restriction sites, the following fragments were inserted to be ligated; a fragment 
was prepared by binding of a sequence ranging from 5' NTR to Core region 
derived from pJFH-1 with the neomycin resistance gene derived from pRSVSNEO 
by PGR amplification and then cleaving it with restriction enzymes Age 1 and Pme 
1, and, a fragment was prepared by binding of sequences ranging from EMCV 
IRES to NS3 region by PGR amplification and then cleaving it with restriction 
enzymes Pme I and Cla I. 
[0068] 

Moreover, a mutation that mutates an amino acid motif GDD to GND, 
corresponding to the active center of RNA polymerase encoded by the NS5B 
region, was introduced into the NS5B region in pSGREP-JFHl. thereby preparing 
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a mutant plasmid clone pSGREP-JFHl/GND. 
[0069] 

Moreover, a mutation that results in tlie deletion of a sequence of 10 
continuous amino acids containing an amino acid motif GDD corresponding to tlie 
active center of RNA polymerase encoded by the NS5B region was introduced into 
the NS5B region in pSGREP-JFHl, thereby preparing a mutant plasmid clone 
pSGREP-JFHl/dGDD. 
[0070] 

The above-prepared mutant clones pSGREP-JFHI/GND and pSGREP- 
JFHl/dGDD cannot express active NS5B protein, which is required for the 
replication of replicon RNA, because the amino acid sequence of the active site of 
NS5B protein encoded by these clones has mutated. 
[0071] 

fB^ Prepar ation of reolicon RNA 

To prepare template DNA for use in synthesis of replicon RNA, the above- 
constructed expression vectors pSGREP-JFHl, pSGREP-JCHl, pSGREP- 
JFHI/GND and pSGREP-JFHl/dGDD were each cleaved with a restriction enzyme 
Xba I. 
[0072] 

Subsequently, 10 to 20 jig each of these Xba l-cleaved fragments was 
contained in 50 |.il of a reaction solution, and then further treated by 30 minutes of 
incubation at 30''C with 20 U of Mung Bean Nuclease. Mung Bean Nuclease is 
an enzyme catalyzing a reaction for selectively degrading a single-stranded 
portion of double-stranded DNA. Generally, when RNA synthesis is performed 
using directly the above Xba I-cleaved fragment as a template, a replicon RNA 
having four nucleotides of CUGA, a part of the recognition sequence of Xba I, 
excessively added to the 3' terminus would be synthesized. Hence, in this 
example, Xba I-cleaved fragments were treated with Mung Bean Nuclease, so as to 
remove the four nucleotides of CUGA from the fragments. The solutions 
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containing Xba I-cieaved fragments, which had been treated with Mung Bean 
Nuclease, were treated to remove proteins according to a general method, so that 
Xba I-cleaved fragments, from which the four nucleotides of CUGA had been 
removed, were purified and used as template DNAs. 
[0073] 

Next, from the template DNA. RNA was synthesized in vitro using T7 
RNA polymerase. For this RNA synthesis. MEGAscript from Ambion, Inc. was 
used. Reaction was carried out using 20 ^1 of a reaction solution containing 0.5 
to 1.0 micrograms of the template DNA according to the instructions of the 
manufacturer. 
[0074] 

After completion of RNA synthesis, DNase (2 U) was added to the reaction 
solution to conduct reaction at 3rc for 15 minutes. RNA extraction using acidic 
phenol was further performed to remove the template DNA. RNAs (rep 1 icon 
RNAs) synthesized in this manner from the above template DNAs derived from 
pSGREP-JFHl, pSGREP-JCHI. pSGRBP-JFHl/GND and pSGREP-JFHI/dGDD 
were respectively named rSGREP-JFHl, rSGREP-JCHl, rSGREP-JFH]/GND and 
rSGREP-JFHl/dGDD. Regarding the nucleotide sequences of tliese replicon 
RNAs, the nucleotide sequence of rSGREP-JFHl is shown in SEQ ID NO: 1 and 
Fig, 2, that of rSGREP-JCHl is shown in SEQ ID NO: 2 and Fig. 3, that of 
rSGREP-JFHI/GND is shown in SEQ ID NO: 7, and that of rSGREP-JFHl/dGDD 
is shown in SEQ ID NO: 8. 
[0075] 

[Example 2] Establishment of repiicon-replicating cell clone 

f C ) Transfection of replicon RNA. determi n ation of colnny-forming ;,hilitv nf 
transfected cells and e stablishment of cell clnnft«; 

Each of the above-synthesized replicon RNAs (rSGREP-JFHI, rSGREP- 
JCHI, rSGREP.JFHl/GND and rSGREP-JFHl/dGDD) was mixed in different 
quantities with total cellular RNA extracted from Huh7 ceils so as to have a total 
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RNA quantity of 10 ^g. Subsequently, the mixed RNA was introduced into Huh? 
ceils by the e lectio poiation method. The Huh? cells subjected to the 
electroporation treatment were seeded into culture dishes, and then cultured for 16 
hours to 24 hours. G418 (neomycin) was then added to the culture dishes at 
different concentrations. Thereafter, culture was continued while exchanging the 
culture solutions twice a week. After 21 days of culture following seeding, 
surviving cells were stained with crystal violet. The number of stained colonies 
was counted, and then the number of colonies obtamed per i^g of the transfected 
replicon RNA was calculated. 
[0076] 

For rSGREP-JFHI or rSGREP-JCHl -transfected cells, for which colony 
formation had been observed, colonies of the surviving cells were further cloned 
from the above culture dishes after 21 days of culture, and were continuously 
cultured. By such cloning of colonies, several strains of cell clones could be 
established. 
[007?] 

For the established cell clones, detection of the replicated replicon RNA, 
confirmation of the presence or the absence of the incorporation of the neomycin 
resistance gene into the host genomic DNA, and confirmation of the expression of 
HCV proteins were performed as described later, in Example 4. Cell clones for 
which the replication of the replicon had been confirmed in the cells were 
regarded as replicon-replicating cell clones. 
[0078] 

(D) Colony-forming ability in each transfected cell 

As a result of the above transfection, for rSGREP-JFHl -transfected Huh? 
cells, the colony- forming ability per |.ig of the transfected replicon RNA was 
94700 CFU (Colony Forming Unit)/ng-RNA when G418 concentration was 1.0 
mg/ml (the left column in Fig. 4). In contrast, colony formation was not 
observed in the Huh? cells, into which rSGREP-JFHl/dGDD and rSGREP- 
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JFHl/GND had each been transfected (the central column and the right column in 
Fig. 4), This suggests that the colony-forming ability confirmed for the Huh? 
cells, into which rSGREP-JFHl replicon RNA had been transfected, depends on 
the activity of NS5B (RNA polymerase) expressed by rSGREP-JFHI. 
Specifically, it was considered that in ceils that had formed colonies, rSGREP- 
JFHl replicon RNA autonomously replicated due to the action of NS5B expressed 
by rSGREP-JFHl, and the neomycin resistance gene was continuously expressed 
to maintain G418 resistance, so that cell growth was enabled. 
[0079] 

On the other hand, in the Huh7 cells, into which rSGREP-JCHl had been 
transfected, no colony formation was observed in the case of I to 0.5 mg/ml G418 
concentrations (Fig, 5). When G418 concentration was lowered to 0.25 mg/ml, 
colony formation was observed in the Huh? cells, into which rSGREP-JCHI had 
been transfected as well. 
[0080] 

Furthermore, Xba I-cleaved fragment of the expression vector pSGRBP- 
JFHI obtained in (B) above was used as a template DNA for RNA synthesis 
whhout treating the fragment with Mung Bean Nuclease, so as to synthesize 
replicon RNA. This replicon RNA was transfected to Huh? cells in a manner 
similar to that in (C) above. The replicon RNA that had been prepared without 
performing iVlung Bean Nuclease treatment had the four nucleotides of CUGA 
excessively added to the 3' terminus. 
[0081] 

As a result, the colony-forming ability of the Huh? cells, into which the 
replicon RNA prepared without treatment with Mung Bean Nuclease had been 
transfected, decreased to 512 CFU/jig-RNA (the left side in Fig, 6). This result 
revealed that the sequence on the 3' terminus of the replicon RNA affects the 
colony-forming ability of the transfected cells. 
[0082] 
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[Example 3] 

fE^ Re-transfection of replicated replicon RNA derived from replicon-replicating 
cells 

From the replicon-replicating cell clones that had been established by 
transfection of rSGREP-JFHl into Huii7 cells according to descriptions of 
Example 2, total RNA was extracted by a standard procedure. The number of 
copies of the replicated replicon RNA contained in the cellular RNA was 
determined by Northern blot analysis and a quantitative RT-PCR method. 
[0083] 

Northern blot analysis was performed according to the description in 
Molecular Cloning, A laboratory Manual, 2"'^ edition, J. Sambrook, E. F. Fritsch, T, 
Maniatis, Cold Spring Harbor Laboratory Press (1989). Specifically, RNA 
extracted from the cells was subjected to denaturing agarose electrophoresis. 
After electrophoresis, the RNA was transferred onto a positively charged nylon 
membrane. The ^^P-labeled DNA or RNA probe prepared from pSGREP-JFHl 
was hybridized to the RNA transferred to the membrane as described above. 
Next the membrane was washed, and then exposed to a film, so as to detect a 
replicon-specific RNA band. 
[0084] 

Detection of the replicon RNA by quantitative RT-PCR was conducted by 
detecting the 5' untranslated region RNA within HCV RNA according to Takeuchi 
T, Katsume A, Tanaka T, Abe A, Inoue K, Tsukiyama-Kohara K, Kawaguchi R, 
Tanaka S and Kohara M., Real-time detection system for quantification of 
Hepatitis C virus genome, Gastroenterology 116: 636-642 (1999). Specifically, 
the replicon RNA contained in RNA extracted from the ceils was amplified by 
PGR using synthetic primers: R6-]30-SI7, 5'-CGGGAGAGCCATAGTGG-3' (SEQ 
ID NO: 13) and R6-290-R19, 5'-AGTACCACAAGGCCTTTCG-3' (SEQ ID NO: 
14); TaqMan Probe; R6-148-S21FT, 5'-CTGCGGAACCGGTGAGTACAC-3' (SEQ 
ID NO: 15) and an EZ rTth RNA PCR kit, and then detected using an ABI Prism 
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7700 sequence detector system. 
[0085] 

Next, aliquots of total cellular RNAs extracted from clone 6 (among the 
above-mentioned replicon-replicating cell clones) and pool clones (prepared by 
collecting replicon-replicating cells that had formed colonies from whole one dish 
and cuhuring them) were each introduced into another Huh? cells by 
retransfection. Total cellular RNA used for the transfection was prepared to 
contain 1x1 0' copies of replicon RNA based on the number of copies of the above- 
determined replicon RNA. Transfection was performed as described In (C) above, 
and then selective culture was performed under G418 concentration conditions of 
I mg/mi. Thus, the colony formation of the replicon-replicating cells was 
observed (Fig. 7). The colony-forming ability in this case was 1 colony or more 
per 1x10^ copies of the replicon RNA used for transfection, when it was calculated 
from the number of colonies obtained. 
[0086] 

On the other hand, the number of copies of in vitro synthetic RNA that had 
been synthesized in vitro using pSGREP-JFHl as a template and T7 RNA 
polymerase was approximately 2xl0" copies/(.ig-RNA, when calculated based on 
the weight and the length of the RNA. The colony-forming ability in the case of 
using the in vitro synthetic RNA for transfection in a manner similar to the above 
method was 1 colony per 5x1 0' copies. These results revealed that when RNA 
derived from cells extracted from replicon-replicating cells and in vitro synthetic 
RNA were each transfected to Huh7 cells as replicon RNA In the same number of 
copies, the use of the replicon RNA replicated within Huh7 cells resulted in 
colony-forming ability approximately 50 times higher than that of the in vitro 
synthetic RNA. 
[0087] 
[Example 4] 

fF) Detection of replicon RNA 
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According to (E) above, cell clones [clones Nos, I to 11] were established 
by retransfection of total RNA that had been obtained from the replicon- 
replicating cell clone established by transfection of rSGREP-JFHI to Huh7 cells to 
another Huh7 cells. From the established cell clones and pool clones {prepared 
by collecting cell clones that had formed colonies from whole one dish and then 
citlturing them), respectively, total RNAs were extracted by an acidic phenol 
extraction method. Subsequently the total RNAs were analyzed by the Northern 
blot method using a pSGREP-JFHI -specific probe as a probe. As control, total 
RNA extracted similarly from untransfected Huh7 cells (in Fig. 8, denoted as 
"Huh7"), a sample prepared by adding 10'' copies of replicon RNA synthesized in 
vitro to the total RNA extracted from Huh7 cells (in Fig, 8, denoted as "lO'"), and 
a sample (in Fig. 8, denoted as "10^") prepared by adding 10^ copies of replicon 
RNA synthesized in vitro to the total RNA extracted from Huh7 cells, were used. 
In Fig. 8, 1 to 1 1 represent cell clone Numbers. 
[0088] 

As a result, RNA of approximately the same size as that of rSGREP-JFHl 
was detected using a pSGREP-JFHl -specific probe (Fig. 8). Thus, it was 
confirmed that the replicon RNA from rSGREP-JFHI that had been transfected at 
the beginning replicated and proliferated within the cell clones. In addition, it 
was shown that the cell clones differed from each other in the quantity of the 
replicated replicon RNA. In Fig. 8, for example, clones 2, 6, 9 and 10 contained 
high quantities of the replicated replicon RNA, and clones 4, 8 and 11 contained 
low quantities of the replicated replicon RNA. 
[0089] 

(G) Confirmation of the presence or the absence of the incorporation of a 
neomycin resistance gene into genomic DN A 

For the cell clones that had been obtained by retransfection of replicon 
RNA as described in Example 3, PGR amplification -was performed using 
neomycin resistance gene-specific primers; sense primer, NEO-S3: 5'- 
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AACAAGATGGATTGCACGCA-3* (SEQ ID NO: 16) and antisense primer, NEO- 
R: 5'-CGTCAAGAAGGCGATAGAAG-3' (SEQ ID NO: 17), and the host cellular 
genomic DNA extracted from each of the cell clones as a template, in order to 
confirm that the resistance of each of the cell clones against G418 was not due to 
the incorporation of the neomycin resistance gene into the genome. The cell 
clones used herein were the cell clones Nos. 1 to 8 obtained by retransfection of 
rSGREP-JFHl -derived replicated replicon RNA (rSGREP-JFHl -derived cell 
clones Nos. I to 8), and cell clones Nos. I to 6 obtained by retransfection of 
rSGREP-JCH I -derived replicated replicon RNA (rSGREP-JCH I -derived cell 
clones Nos. I to 6). As a result, as shown in Fig. 9, in the eight examined 
rSGREP-JFH I -derived cell clones, positive clones showing the amplification of 
the neomycin resistance gene were not observed. For rSGREP-JCHl-derived cell 
clones, only 1 out of the 6 examined clones was positive (in Fig. 9, lane 3 in the 
right photograph). It was considered that this positive clone had acquired G418 
resistance by the incorporation of the neomycin resistance gene in rSGREP-JCHl- 
derived replicated replicon RNA into the genomic DNA of the host cells. Thus, 
in the positive clone, unlike other clones, it was thought that the replicon RNA 
itself did not autonomously replicate within the cells. This was confirmed by the 
results of the experiment shown in the next (H) that no HCV proteins were 
detected from the positive clone. 
[0090] 

fm Detection of HCV protein 

Protein was extracted from rSGREP-JFHl- and rSGREP-JCHl-transfected 
cell clones by a standard procedure, and then analyzed by SDS-PAGE and Western 
blot method (Fig. 10). The examined cell clones were the same as those used in 
(G) above: rSGREP-JFHl -derived cell clones Nos. 1 to 8 and rSGREP-JCHl- 
derived cell clones Nos. 1 to 6. In addition, a cellular extract from tlie cell 
obtained by transiently transfecting expression plasmid DNA containing NS3 gene 
into Huh? cells was regarded as a positive control (NS3 protein). Furthermore, a 

38 



JP Application No. 2003-148242 



protein extracted from the untransfected Huh7 cells was used as a negative control. 
A protein sample extracted from each cell clone was blotted onto a PVDF 
membrane (Immobilon-P, Miiiipore), and then detection of NS3 protein encoded 
by replicated replicon RNA was performed using anti-NS3-specific antibody 
(provided by Dr. Moradpour; Wo lit B, et al, J. Virology. 2000, 74: 2293-2304). 
As shown in Fig. 10, in rSGREP-JFHl -derived cell clones Nos, 1 to 8 and 
rSGREP-JCHl-derived cell clones Nos. I, 2 and 4 to 6, proteins of the same size 
as those of the positive control were detected. In rSGREP-JCH I -derived cell 
clone No. 3 (the clone detected as a positive clone in (G) above), no expression of 
NS3 protein was detected. That is, in rSGREP-JCHl -derived cell clone No. 3, no 
replication of replicon RNA was confirmed. NS3 protein was not detected in the 
untransfected Huh? cells, revealing that in cell clones wherein NS3 protein was 
detected, the transfected replicon RNA autonomously replicated so that NS3 
protein was expressed. 
[0091] 

Moreover, by the use of the serum of a hepatitis C patient as an antibody, 
the expression of NS5a protein from the replicon RNA was also confirmed in each 
cell clone for which the expression of NS3 protein had been confirmed as 
described above. 
[0092] 

Based on the results of (G) and (H) above, it was confirmed that replicon 
RNAs were replicated in the cell clones established by transfection of the replicon 
RNA. 

[0093] 
[Example 5] 

(n Analysis of adaptive mutation 

According to descriptions of Example 3, total RNA obtained from the 
replicon-replicating cell clones established through the transfection of rSGREP- 
JFHl into Huh? cells was re-trans fected to another Huh? cells, thereby 
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establishing 21 cell clones. Total RNA was extracted from each of these cell 
clones by a standard procedme. cDNA corresponding to the replicon RNA was 
synthesized using the total RNA as a template, reverse transcriptase Superscript II 
(Invitrogen) and primer 9641R-IH (5'- 

GCACTCTCTGCAGTCATGCGGCTCACGGAC-3' (SEQ ID NO: 18)). The 
composition of a reaction solution for the synthesis of cDNA by reverse 
transcription reaction is as shown below. 
[0094] 

Composition of Reaction Solution Fluid Volume (jxl) 
5k 1st strand Buffer 4 
2 mM dNTP 5 
0.1 M DTT 1 
965 IR-IH primer (100 pM) 1 
DW (distilled water) 6.5 
Sample RNA (2 mg/mL) 1 
RNasin (Pro mega) (40 U/^tL) 0.5 

Superscript 11 RT (Invitrogen) 1 

Total 20 f.il 

[0095] 

In cDNA synthesis reaction, the above reagents other than RNasin and 
Superscript II were mixed to prepare a first reaction solution. The solution was 
heated at 90''C for 3 minutes, and then cooled on ice. Subsequently, RNasin and 
Superscript O were added to this reaction solution, and then the solution was 
allowed to react at 42°C for 1 hour, followed by another reaction at 70°C for 15 
minutes. 
[0096] 

Furthermore, PGR amplification was performed using the thus obtained 
cDNA together with five primer sets by the following procedures, so that DNA 
amplification fragments covering almost all the regions of the replicon RNA were 
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obtained. The primer sets used and regions amplified by each primer set are 
shown in Table i and Table 2 below. 

[0097] 
[Table 1] 



Designation of 
amplified fragment 


Primer set 


Amplified region 


Primer I 


Primer 2 


A/ 


42S-IH 


433R-neo 


41 - 470 


B/ 


C/S17ssp 


4680R-IH 


28 - 3026 


CI 


4534S-IH 


7279R-IH 


2880 - 5625 


D/ 


7I98S.IH 


9367R-IH 


5544 - 7713 


E/ 


9247S-NF 


9576R-NF 


7597 - 7960 



[0098] 

In Table 1, an amplified region is represented by nucleotide numbers in 
rSGREP-JFHl (SEQ ID NO: 1) that the region corresponds to. 
[0099] 
[Table 2] 



Primer 
designation 


Nucleotide sequence (5'— >3') 


SEQ ID NO: 


42S-IH 


CCCCTGTGAGGAACTACTGTCTTCACGC 


SEQ ID NO: 19 


C/S17ssp 


CCGGGAGAGCCATAGTGGTCTGCG 


SEQ ID NO: 20 


4534S-IH 


CCACTCAAAGAAAAAGTGTGACGAGCTCGC 


SEQ ID NO: 21 


7198S-IH 


GGCTTGGGCACGGCCTGA 


SEQ ID NO: 22 


9247S-NF 


GCGGTGAAGACCAAGCTCAAACTCACTCCA 


SEQ ID NO: 23 


433R-neo 


AGAACCTGCGTGCAATCCATC 


SEQ ID NO: 24 


4680R-IH 


CCCGTCATGAGGGCGTCGGTGGC 


SEQ ID NO: 25 


7279R-IH 


ACCAGCAACGGTGGGCGGTTGGTAATC 


SEQ ID NO: 26 


9367R-R1 


GGCACGCGACACGCTGTG 


SEQ ID NO; 27 


9576R-NF 


AGCTAGCCGTGACTAGGGCTAAGATGGAGC 


SEQ ID NO: 28 



[0100] 

The composition of a reaction solution in this PGR reaction is as follows. 

[010!] 
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Composition of Reaction Solution 


Fluid Volume ()il) 


Primer 1 (10 |jM) 


1.0 


Primer 2 (10 )iM) 


1.0 


2,5 niM dNTPs 


5.0 


lOx LA Buffer 


5.0 


MgCh (25 mM) 


5.0 


LATaq (TAKARA) (5 U/nI) 


0.3 


DW (distilled water) 


30.7 


Template cDNA 


2.0 


Total 


50 ^1 



[0102] 

In addition, PGR reaction conditions are as follows: 95°C for 2 minutes; 
35 cycles of 98°C for 10 seconds and then 68°C for 8 minutes; and 72°C for 7 
minutes; after which the temperature is kept at 4°C. 
[0103] 

The nucleotide sequence of each PGR product obtained as described above 
was determined, and then the RNA sequence corresponding to the DNA sequence 
was compared with the sequence of rSGREP-JFHl. The results are shown in 
Table 3, 
[0104] 
[Table 3] 



Region Synonymous Nonsynonymous Total number of 

substitution substitution mutations 

NS3 0 5 5 

NS4A 0 2 2 

NS4B 0 3 3 

NS5A 0 7 7 

NS5B 3 5 8 

Total 3 22 25 



[0105] 
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As shown in Table 3, total number of nucleotide mutations observed in 21 
cell clones was 25. 22 of these mutations were nonsynonymous substitutions 
inducing amino acid mutation. Types of these mutations are as shown in Table 4. 
In addition, the positions of these mutations in the non-structural region are shown 
in Fig. 1 1 . 
[0106] 
[Table 4] 



Clone . ^ Mutation site 

desienation Nucleotide No. Nucleotide mutation Amino acid mutation Amino acid No. 



CI 


7098 


A 


G 


None 






7157 


A ^ 


G 


Y ^ C 


2824 


C2 


4955 


C =^ 


U 


A V 


2090 


C3 


4936 


A 


G 


T A 


2084 




5000 


A => 


G 


Y => C 


2105 




7287 


A ^ 


G 


None 






7288 


A =^ 


G 


M =^ V 


2868 


C4 


5901 


G =^ 


U 


E => D 


2405 




6113 


A 


U 


H ^ L 


2476 


C5 


2890 


A =^ 


G 


K E 


1402 


C6 


7209 


A 


G 


None 





[0107] 

In Table 4 and Fig. 11, "CI to C6" represent replicon-replicating cell 
clones CI to C6 having replicon RNA found to have mutations. "Nucleotide No." 
shows the corresponding nucleotide numbers within the nucleotide sequence of 
replicon RNA rSGREP-JFHl (SEQ ID NO: I). "Amino acid No." shows the 
corresponding amino acid numbers within the amino acid sequence encoded by the 
JFH-I clone (SEQ ID NO: 4). The types of nucleotides and amino acids at 
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mutation sites are described according to their general notations. As shown in 
Table 4, in clone C2, a nucleotide corresponding to nucleotide No. 4955 of SEQ 
ID NO: I on the replicon RNA mwtated from C (cytosine) to U (uracil), which 
results in a mutation of an amino acid corresponding to amino acid No. 2090 of 
SEQ ID NO: 4 from A (alanine) to V (valine). 
[0108] 

Furthermore, mutation positions shown in Fig. 11 are shown with bar lines 
with the nucleotide numbers shown in Table 4. A thick bar line represents 
nonsynonymous substitution, and a thin bar line represents synonymous 
substitution. 
[0109] 

There were 2 clones having no nucleotide mutations at all that cause amino 
acid mutations. When Northern blot analysis was conducted for the 2 clones, it 
was shown that in these 2 clones, the quantity of replicon RNAs replicated was 
lower than those in the cell clones that had replicated replicon RNAs having a 
nucleotide mutation that causes an amino acid mutation. Hence, it was 
considered that the nucleotide mutation causing an amino acid mutation within the 
replicon RNA was an adaptive mutation for increasing the replication efficiency of 
the replicon RNA in Huh7 cells. 
[0110] 

[Effects of Invention] 

According to the present invention, an HCV-RNA replicon derived from 
the genotype 2a strain of HCV was obtained for the first time. The replicon- 
replicating cell according to the present invention can be used as a culture system 
for the continuous production of RNA and HCV proteins derived from HCV of 
genotype 2a. Furthermore, the replicon-replicating cell according to the present 
invention is useful as a test system for screening for various substances that affect 
HCV replication and/or the translation of HCV proteins. 
[0111] 
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[Sequence Listing] 

SEQUENCE LISTING 
<H0> Toray Industries Inc. 

Tokyo Metropolitan Organization Cor Medical Research 
Johannes Gutenberg-Univereitaet Mainz 

<120> Establishment of the genotype 2a Hepatitis C virus subgenomic 
<130> P03-0429 




<160> 28 

<170> PatentIn Ver, 2,1 

<210> 1 
<211> 8024 
<212> RHA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; replicon 
<220> 

<2a3> Inventor: Wakita, Takaji 
Inventor; Kato, Takanobu 
Inventor; Date, Totnoko 

<400> 1 
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accugccccu aauaggggcg acacuccgoc augaaucacu ccccugugag gaacuacugu 60 
cuucacgcag aaagcgccua gccauggcgu uaguaugagu gucguacagc cuccaggccc 120 
cccccucccg ggagagccau aguggucugc ggaaccggug aguacaccgg aauugccggg 18 0 
aagacugggu ccuuucuugg auaaacccac ucuaugcccg gccauuuggg cgugcccccg 240 
caagacugcu agccgaguag cguuggguug cgaaaggccu ugugguacug ccugauaggg 3 00 
cgcuugcgag ugccccggga ggucucguag accgugcacc augagcaoaa auccuaaacc 3 60 
ucaaagaaaa accaaaagaa acaccaaccg ucgcccaaug auugaacaag auggauugca 420 
cgcagguucu ccggccgcuu ggguggagag gcuauucggc uaugacuggg cacaacagac 480 
aaucggcugc ucugaugccg ccguguuccg gcugucagcg caggggcgcc cgguucuuuu 54 0 
ugucaagacc gaccuguccg gugcccugaa ugaacugcag gacgaggcag cgcggcuauc 600 
guggcuggcc acgacgggcg uuccuugcgc agcugugcuc gacguuguca cugaagcggg 66 0 
aagggacugg cugcuauugg gcgaagugcc ggggcaggau cuccugucau cucaccuugc 72 0 
uccugccgag aaaguaucca ucauggcuga ugcaaugcgg cggcugcaua cgcuugaucc 780 
ggcuaccugc ccauucgacc accaagcgaa acaucgcauc gagcgagcac guacucggau 84 0 
ggaagccggu cuugucgauc aggaugaucu ggacgaagag caucaggggc ucgcgccagc 900 
cgaacuguuc gccaggcuca aggcgcgcau gcccgacggc gaggaucucg ucgugaccca 960 
uggcgaugcc ugcuugccga auaucauggu ggaaaauggc cgouuuucug gauucaucga 1020 
cuguggccgg cugggugugg cggaccgcua ucaggacaua gcguuggcua cccgugauau 1080 
ugcugaagag cuuggcggcg aaugggcuga ccgcuuccuc gugcuuuacg guaucgccgc 1140 
ucccgauucg cagcgcaucg ccuucuaucg ccuucuugac gaguucuucu gaguuuaaac 1200 
ccucucccuc cccccccccu aacguuacug gccgaagecg cuuggaauaa ggccggugug 1260 
cguuugucua uauguuauuu uccaccauau ugccgucuuu uggcaaugug agggcccgga 13 2 0 
aaccuggccc ugucuucuug acgagcauuc cuaggggucu uuccccucuc gccaaaggaa 13 80 
ugcaaggucu guugaauguc gugaaggaag caguuccucu ggaagcuucu ugaagacaaa 14 4 0 
caacgucugu agcgacccuu ugcaggcagc ggaacccccc accuggcgac aggugccucu 1500 
gcggccaaaa gccacgugua uaagauacac cugcaaaggc ggcacaaccc cagugccacg 1560 
uugugaguug gauaguugug gaaagaguca aauggcucuc cucaagcgua uucaacaagg 162 0 
ggcugaagga ugcccagaag guaccccauu guaugggauc ugaucugggg ccucggugca 1680 
caugcuuuac auguguuuag ucgagguuaa aaaaacgucu aggccccccg aaccacgggg 174 0 
acgugguuuu ccuuugaaaa acacgaugau accauggcuc ccaucacugc uuaugcccag 1800 
caaacacgag gccuccuggg cgccauagug gugaguauga cggggcguga caggacagaa 1860 
caggccgggg aaguccaaau ccuguccaca gucucucagu ccuuccucgg aacaaccauc 1920 
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ucggggguuu uguggacugu uuaccacgga gcuggcaaca agacucuagc cggcuuacgg 1980 
gguccgguca cgcagaugua cucgagugcu gagggggacu ugguaggcug gcccagcccc 2040 
ccugggacca agucuuugga gccgugcaag uguggagccg ucgacouaua uouggucacg 2100 
cggaacgcug augucauccc ggcucggaga cgcggggaca agcggggagc auugcucucc 2160 
ccgagaccca uuucgaccuu gaaggggucc ucgggggggc cggugcucug cccuaggggc 2220 
cacgucguug ggcucuuccg agcagcugug ugcucucggg gcguggccaa auccaucgau 2280 
uucauccccg uugagacacu cgacguuguu acaaggucuc ccacuuucag ugacaacagc 23 4 0 
acgccaccgg cugugcccca gaccuaucag gucggguacu ugcaugcucc aacuggcagu 2400 
ggaaagagca ccaagguccc ugucgcguau gccgcccagg gguacaaagu acuagugcuu 24 60 
aaccccucgg uagcugocac ccugggguuu ggggcguacc uauccaaggc acauggcauc 2520 
aaucccaaca uuaggacugg agucaggacc gugaugaccg gggaggccau cacguacucc 2580 
acauauggca aauuucucgc cgaugggggc ugcgcuagcg gcgccuauga caucaucaua 2640 
ugcgaugaau gccacgcugu ggaugcuacc uccauucucg gcaucggaac gguccuugau 2700 
caagcagaga cagccggggu cagacuaacu gugcuggcua cggccacacc ccccggguca 2760 
gugacaaccc cccaucccga uauagaagag guaggccucg ggcgggaggg ugagaucccc 282 0 
uucuauggga gggcgauucc ccuauccugc aucaagggag ggagacaccu gauuuucugc 2880 
cacucaaaga aaaaguguga cgagcucgcg gcggcccuuc ggggcauggg cuugaaugcc 294 0 
guggcauacu auagaggguu ggacgucucc auaauaccag cucagggaga uguggugguc 3000 
gucgccaccg acgcccucau gacgggguac acuggagacu uugacuccgu gaucgacugc 3 060 
aauguagcgg ucacccaagc ugucgacuuc agccuggacc ccaccuucac uauaaccaca 3120 
cagacugucc cacaagacgc ugucucacgc agucagcgcc gcgggcgcac agguagagga 3180 
agacagggca cuuauaggua uguuuccacu ggugaacgag ccucaggaau guuugacagu 3240 
guagugcuuu gugagugcua cgacgcaggg gcugcguggu acgaucucac accagcggag 33 00 
accaccguca ggcuuagagc guauuucaac acgcccggcc uacccgugug ucaagaccau 3360 
cuugaauuuu gggaggcagu uuucaccggc cucacacaca uagacgccca cuuccucucc 3420 
caaacaaagc aagcggggga gaacuucgcg uaccuaguag ccuaccaagc uacggugugc 34 80 
gccagagcca aggccccucc cccguccugg gacgccaugu ggaagugccu ggcccgacuc 3540 
aagccuacgc uugcgggccc cacaccucuc cuguaccguu ugggcccuau uaccaaugag 3 600 
gucacccuca cacacccugg gacgaaguac aucgccacau gcaugcaagc ugaccuugag 3660 
gucaugacca gcacgugggu ccuagcugga ggaguccugg cagccgucgc cgcauauugc 3 720 
cuggcgacug gaugcguuuc caucaucggc cgcuugcacg ucaaccagcg agucgucguu 378 0 
gcgccggaua aggagguccu guaugaggcu uuugaugaga uggaggaaug cgccucuagg 3 84 0 
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gcggcucuca ucgaagaggg gcagcggaua gccgagaugu ugaaguccaa gauccaaggc 3 900 
uugcugcagc aggccucuaa gcaggcccag gacauacaac ccgcuaugca ggcuucaugg 3960 
cccaaagugg aacaauuuug ggccagacac auguggaacu ucauuagogg cauccaauac 4020 
cucgcaggau ugucaacacu gccagggaac cccgcggugg cuuccaugau ggcauucagu 4080 
gccgcccuca ccaguccguu gucgaccagu accaccaucc uucucaacau caugggaggc 414 0 
ugguuagcgu cccagaucgc accacccgcg ggggccaccg gcuuugucgu caguggccug 4200 
gugggggcug ccgugggcag cauaggccug gguaaggugc ugguggacau ccuggcagga 4260 
uauggugcgg gcauuucggg ggcccucguc gcauucaaga ucaugucugg cgagaagccc 4320 
ucuauggaag augucaucaa ucuacugccu gggauccugu cuccgggagc ccugguggug 43 80 
ggggucaucu gcgcggccau ucugcgccgc cacgugggac cgggggaggg cgcgguccaa 444 0 
uggaugaaca ggcuuauugc cuuugcuucc agaggaaacc acgucgcccc uacucacuac 4 5 00 
gugacggagu cggaugcguc gcagcgugug acccaacuac uuggcucucu uacuauaacc 4 5 60 
agccuacuca gaagaoucca caauuggaua acugaggacu gccccauccc augcuccgga 4 62 0 
uccuggcucc gcgacgugug ggacuggguu ugcaccaucu ugacagacuu caaaaauugg 4680 
cugaccucua aauuguuccc caagcugccc ggccuccccu ucaucucuug ucaaaagggg 4740 
uacaagggug ugugggccgg cacuggcaue augaccacgc gcugcccuug cggcgccaac 48 00 
aucucuggca auguccgccu gggcucuaug aggaucacag ggccuaaaac cugcaugaac 4860 
accuggcagg ggaccuuucc uaucaauugc uacacggagg gccagugcgc gccgaaaccc 4 920 
cccacgaacu acaagaccgc caucuggagg guggcggccu cggaguacgc ggaggugacg 4 980 
cagcaugggu cguacuccua uguaacagga cugaccacug acaaucugaa aauuccuugc 5040 
ccagaguu uuucuccugg guggacggug ugcagaucca uagguuugca 5100 
. agccguuuuu ccgggaugag gucucguucu gcguugggcu uaauuccuau 5160 
gcugucgggu cccagcuucc cugugaaccu gagcccgacg cagacguauu gagguccaug 5220 
cuaacagauc cgccccacau cacggcggag acugcggcgc ggcgcuuggc acggggauca 5280 
ccuccaucug aggcgagcuc cucagugagc cagcuaucag caccgucgcu gcgggccacc 5340 
ugcaccaccc acagcaacac cuaugacgug gacauggucg augccaaccu gcucauggag 54 00 
ggcggugugg cucagacaga gccugagucc agggugcccg uucuggacuu ucucgagcca 54 6 0 
auggccgagg aagagagcga ccuugagccc ucaauaccau cggagugcau gcuccccagg 5520 
agcggguuuc cacgggccuu accggcuugg gcacggccug acuacaaccc gccgcucgug 5580 
gaaucgugga ggaggccaga uuaccaaccg cccaccguug cugguugugc ucuccccccc 5640 
cccaagaagg ccccgacgcc ucccccaagg agacgccgga oagugggucu gagcgagagc 5700 
accauaucag aagcccucca gcaacuggcc aucaagaccu uuggccagcc ccccucgagc 5760 
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ggugaugcag gcucguccac gggggcgggc gccgccgaau ccggcggucc gacguccccu 5820 
ggugagccgg cccccucaga gacagguucc gccuccucua ugcccccccu cgagggggag 5880 
ccuggagauc cggaccugga gucugaucag guagagcuuc aaccuccccc ccaggggggg 594 0 
gggguagcuc ccgguucggg cucggggucu uggucuacuu gcuccgagga ggacgauacc 60 0 0 
accgugugcu gcuccauguc auacuccugg accggggcuc uaauaacucc cuguagcccc 6060 
gaagaggaaa aguugccaau caacccuuug aguaacucgc uguugcgaua ccauaacaag 612 0 
guguacugua caacaucaaa gagogccuca cagagggcua aaaagguaac uuuugacagg 618 0 
acgcaagugc ucgacgccca uuaugacuca gucuuaaagg acaucaagcu agoggcuucc 624 0 
aaggucagcg caaggcuccu caccuuggag gaggcgugcc aguugacucc accccauucu 63 00 
gcaagaucca aguauggauu cggggccaag gagguccgca gcuuguccgg gagggccguu 63 60 
aaccacauca aguccgugug gaaggaccuc cuggaagacc cacaaacacc aauucccaca 6420 
accaucaugg ccaaaaauga gguguucugc guggaccccg ccaagggggg uaagaaacca 6480 
gcucgccuca ucguuuaccc ugacoucggo guccgggucu gcgagaaaau ggcccucuau 6 54 0 
gacauuacac aaaagcuucc ucaggcggua augggagcuu ccuauggcuu ccaguacucc 6600 
ccugcccaac ggguggagua ucucuugaaa gcaugggcgg aaaagaagga ccccaugggu 6660 
uuuucguaug auacccgaug cuucgacuca accgucacug agagagacau caggacogag 6720 
gaguccauau accaggcoug cucccugacc gaggaggccc gcacugccau acacucgcug 67 8 0 
acugagagac uuuacguagg agggcccaug uucaacagca agggucaaac cugcgguuac 684 0 
agacguugcc gcgccagcgg ggugcuaacc acuagcaugg guaacaccau cacaugcuau 6900 
gugaaagccc uagcggccug caaggcugcg gggauaguug cgcccacaau gcugguaugc 69 6 0 
ggcgaugacc uaguagucau cucagaaagc caggggacug aggaggacga gcggaaccug 7020 
agagccuuca cggaggccau gaccagguac ucugccccuc cuggugaucc ccccagaccg 7080 
gaauaugacc uggagcuaau aacauccugu uccucaaaug ugucuguggc guugggcccg 7140 
cggggccgcc gcagauacua ccugaccaga gacccaacca cuccacucgc ccgggcugcc 7200 
ugggaaacag uuagacacuc cccuaucaau ucauggcugg gaaacaucau ccaguaugcu 7260 
ccaaccauau ggguucgcau gguccuaaug acacacuucu iicuccauucu caugguccaa 73 20 
gacacccugg accagaaccu caacuuugag auguauggau caguauacuc cgugaauccu 73 80 
uuggaccuuc cagccauaau ugagagguua cacgggcuug acgccuuuuc uaugcacaca 74 4 0 
uacucucacc acgaacugac gcggguggcu ucagcccuca gaaaacuugg ggcgccaccc 7500 
cucagggugu ggaagagucg ggcucgcgca gucagggcgu cccucaucuc ccguggaggg 7560 
aaagcggccg uuugcggccg auaucucuuc aauugggcgg ugaagaccaa gcucaaacuc 7620 
acuccauugc cggaggcgcg ccuacuggac uuauccaguu gguucaccgu cggcgccggc 7680 
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gggggcgaca uuuuucacag cgugucgcgc 
cuccuacuuu ucguaggggu aggccucuuc 
uagguacacu ccauagcuaa cuguuccuuu 
uuuuuuuuuu cuuuuuuuuu uuuuucccuc 
uuucuuggug gcuccaucuu agcccuaguc 
augacugoag agagugccgu aacuggucuc 



gcccgacccc gcucauuacu cuucggccua 7740 
cuacuccccg cucgguagag cggcacacac 78 00 
uuuuuuuuuu uuuuuuuuuu uuuuuuuuuu 7 8 60 
uuucuucccu ucucaucuua uucuacuuuc 7 92 0 
acggcuagcu gugaaagguc cgugagccgc 79 80 
ucugcagauc augu 8024 



<210> 2 
<211> 8024 
<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: replicon 
<400> 2 

acccgccccu aauaggggcg acacuccgcc augaaucacu ccccugugag gaacuacugu 60 
cuucacgcag aaagcgucua gccauggcgu uaguaugagu gucguacagc cuccaggccc 120 
ccccoucccg ggagagccau aguggucugc ggaaccggug aguacaccgg aauugccggg 18 0 
aagacugggu ccuuucuugg auaaacccac ucuaugcccg gccauuuggg cgugcccccg 24 0 
caagacugcu agccgaguag cguuggguug cgaaaggccu ugugguacug ccugauaggg 3 00 
ugcuugcgag ugccccggga ggucucguag accgugcacc augagcacaa aucccaaacc 3 60 
ucaaagaaaa accaaaagaa acacuaaccg ucgcccaaug auugaacaag auggauugca 42 0 
cgcagguucu ccggccgcuu ggguggagag gcuauucggc uaugacuggg cacaacagac 48 0 
aaucggcugc ucugaugccg ccguguuccg gcugucagcg caggggcgcc cgguuouuuu 54 0 
ugucaagacc gaccuguccg gugcccugaa ugaacugcag gacgaggcag cgcggcuauc 600 
guggcuggcc acgacgggcg uuccuugcgc agcugugcuc gacguuguca cugaagcggg 660 
aagggacugg cugcuauugg gcgaagugcc ggggcaggau cuccugucau cucaccuugc 720 
uccugccgag aaaguaucca ucauggcuga ugcaaugcgg cggcugcaua cgcuugaucc 780 
ggcuaccugc ccauucgacc accaagcgaa acaucgcauc gagcgagcac guacucggau 84 0 
ggaagccggu cuugucgauc aggaugaucu ggacgaagag caucaggggc ucgcgccagc 90 0 
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cgaacuguuc gccaggcuca aggcgcgcau gcccgacggc gaggaucucg ucgugaocca 960 
uggcgaugcc ugcuugccga auaucauggu ggaaaauggc cgcuuuucug gauucaucga 10 2 0 
cuguggccgg cugggugugg cggaccgcua ucaggacaua gcguuggcua cccgugauau 1080 
ugcugaagag cuuggcggcg aaugggcuga ccgcuuccuc gugcuuuacg guaucgccgc 114 0 
ucccgauucg cagcgcaucg ccuucuaucg ccuucuugac gaguucuucu gaguwuaaac 1200 
ccucucccuc cccccGcccu aacguuacug gccgaagccg cuuggaauaa ggccggugug 1260 
cguuugucua uauguuauuu uccaccauau ugccgucuuw uggcaaugug agggcccgga 13 20 
aaccuggccc ugucuucuug acgagcauuc cuaggggucu uuccccucuc gccaaaggaa 1380 
ugcaaggucu guugaauguc gugaaggaag caguuccucu ggaagcuucu ugaagacaaa 144 0 
caacgucugu agcgacccuu ugcaggcagc ggaacccccc accuggcgac aggugccucu 1500 
gcggccaaaa gccacgugua uaagauacac cugcaaaggc ggcacaaccc cagugccacg 1560 
uugugaguug gauaguugug gaaagaguca aauggcucuc cucaagcgua uucaacaagg 162 0 
ggcugaagga ugcccagaag guaccccauu guaugggauc ugaucugggg ecucggugca 16 8 0 
caugcuuuac auguguuuag ucgagguuaa aaaaacgucu aggccccccg aaccacgggg 1740 
acgugguuuu ccuuugaaaa acacgauaau accauggccc ccaucaccgc uuacgcccag 1800 
cagacacgag gucucuuggg cucuauagug gugagcauga cggggcguga caagacagaa 1860 
caggccjgggg agguccaagu ccuguccaca gucacucagu ccuuccucgg aacauccauu 1920 
ucgggggucu uauggacugu uuaccacgga gcuggcaaca agacacuagc cggcucgcgg 1980 
ggcccgguca cgcagaugua cucgagcgcc gagggggacu uggucgggug gcccagcccu 2040 
ccugggacca aaucuuugga gccguguacg uguggagcgg ucgaccugua uuuggucacg 2100 
cggaacgcug augucauccc ggcucgaaga cgcggggaca agcggggagc gcugcucucc 2160 
ccgagacccc uuucgaccuu gaaggggucc ucggggggac cugugcuuug cccuaggggc 2220 
cacgcugucg gaaucuuccg ggcagcugug ugcucucggg guguggcuaa gucciauagau 2280 
u\icauccccg uugagacgcu cgacaucguc acgcggucuc ccaccunuag ugacaacagc 2340 
acaccaccag cugugcccca gaccuaucag gugggguacu ugcacgcccc cacuggcagu 2400 
ggaaaaagca coaagguccc cgucgcguac gccgcccagg gguauaaagu gcuggugcuc 2460 
aaucccucgg uggcugccac ccugggauuu ggggcguacu uguccaaggc acauggcauc 252 0 
aaccccaaca uuaggacugg agucagaacu gugacgaccg gggagcccau uacauacucc 258 0 
acguauggua aauuccucgc cgaugggggc ugcgcaggcg gcgccuauga caucaucaua 2 64 0 
ugcgaugaau gccacucugu ggaugcuacc acuauucucg gcaucgggac aguccuugac 2700 
caagcagaga cagccggggu caggcuaacu guacuggcca cggccacgcc ccccgggucg 2760 
gugacaaccc cccaucccaa uauagaggag guagcccucg gacaggaggg ugagaucccc 2820 
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uucuauggga gggcguuucc ccugucuuac aucaagggag ggaggcacuu gauuuucugc 288 0 
cacucaaaga aaaaguguga cgagcucgca acggcccuuc ggggcauggg cuugaacgcu 2940 
guggcauauu acagaggguu ggacgucucc auaauaccaa cucaaggaga uguggugguc 30 0 0 
guugccaccg acgcccucau gacgggguau acuggagacu uugacuccgu gaucgacugc 3 060 
aacguagcgg ucacccaggc cguagacuuc agccuggacc ccaccuucac uauaaccaca 3120 
cagacugucc cgcaagacgc ugucucacgu agucagcgcc gagggcgcac ggguagagga 318 0 
agacugggca uuuauaggua uguuuccacu ggugagcgag ccucaggaau guuugacagu 3240 
guaguacucu gugagugcua cgacgcagga gcugcuuggu augagcucuc accaguggag 3 3 00 
acgaccguca ggcucagggc guauuucaac acgccuggcu ugccugugug ccaggaccac 33 60 
cuugaguuuu gggaggcagu uuucaccggc cucacacaca uagacgcuca uuuccuuucc 3420 
cagacaaagc agucggggga aaauuucgca uacuuaguag ccuaucaggc cacagugugc 348 0 
gccagggcca aagcgccccc cccguccugg gacgucaugu ggaagugcuu gacucgacuc 3 54 0 
aagcccacgc uugugggccc uacaccucuc cuguaccguu ugggcucugu uaccaacgag 3 600 
gucacccuua cacaccccgu gacaaaauac aucgccacau gcaugcaagc ugaccucgag 3 660 
gucaugacca gcacgugggu ccuggcuggg ggagucuuag cagccgucgc cgcguauugc 3720 
uuagcgaccg gguguguuuc caucauuggc cguuuacaca ucaaccagcg agcugucguc 3780 
gcuccggaca aggagguccu cuaugaggcu uuugaugaga uggaggaaug ugccuccaga 3840 
gcggcuoucc uugaagaggg gcagcggaua gccgagaugc ugaaguccaa gauccaaggc 3 900 
uuauugcagc aagccucuaa acaggcccag gacauacaac ccgcugugca agcuucgugg 3 960 
cccaagaugg agcaauucug ggccaaacau auguggaacu ucauaagcgg cauucaguac 4 020 
cucgcaggac ugucaacacu gccagggaac ccugcugugg cuuccaugau ggcauucagc 4 080 
gccgcccuca ccaguccguu gucaacuagc accaccaucc uucuuaacau ucuggggggc 4140 
uggcuggcgu cccaaauugc gccacccgcg ggggccacug gcuuuguugu caguggccug 42 00 
gugggagcug cuguuggcag cauaggcuug gguaaagugc ugguggacau ccuggcaggg 4 260 
uauggugcgg gcauuucggg ggccqucguc gcguuuaaga ucaugucugg cgagaagccc 4320 
uccauggagg augucaucaa cuugcugccu gggauucugu cuccaggugc ucugguggug 4380 
ggagucaucu gcgcggccau ucugcgccgc caugugggac cgggggaagg cgcgguccaa 444 0 
uggaugaaca ggcuuaucgc cuucgcuucc agaggaaacc acgucgcccc uacucacuac 4500 
gugacggagu cggaugcguc gcagcguguc acccaacugc uuggcucucu cacuauaacu 4560 
agucuacuca ggagacuuca caacuggauc acugaggauu gccccauccc augcgccggc 462 0 
ucguggcucc gcgaugugug ggacuggguc uguaccaucc uaacagacuu uaagaacugg 4680 
cugaccucca agcuguuccc aaagaugccu ggccuccccu uuaucucuug ccaaaagggg 4740 
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uacaagggcg ugugggccgg cacuggcauc augaccacac gaugccccug cggcgccaac 4800 
aucucuggca acguccgcuu gggcucuaug agaaucacag gacccaaaac cugcaugaac 4860 
accuggcagg ggaccuuucc uaucaauugu uauacagaag gccagugcuu gccgaaaccc 4920 
gcguuaaacu ucaagaccgc caucuggaga guggcggccu cagaguacgc ggaagugacg 4980 
cagcacggau cauaugccua uauaacaggg cugacoacug acaacuuaaa agucccuugc 5040 
caacuccccu cuccagaguu uuucucuugg guggacggag uacaaaucca uagguccgcc 5100 
cccacaccaa agccguuuuu ccgggaugag gucucguuca gcguugggcu caauucauuu 5160 
gucgucgggu cucagcuucc cugugacccu gagcccgaca cugagguagu gauguccaug 52 2 0 
cuaacagacc caucccauau cacggcggag gcugcagcgc ggcguuuagc gcggggguca 5280 
cccccaucug aggcaagcuc cucagcgagc cagcugucgg cgccaucgcu gcgagccacc 5340 
ugcaccaccc acgguaggac cuaugaugug gacauggugg augccaaccu guucaugggg S400 
ggcggcguga uucggauaga gucugagucc aaaguggucg uucuggacuc ccucgacuca 5460 
augaccgagg aagagggcga ccuugagccu ucaguaccau cggaguauau gcuccccagg 5520 
aagagguucc caccggccuu accggcuugg gcgcggccug auuacaaccc accgcuugug 5580 
gaaucgugga agaggccaga uuaccaacca cccacuguug cgggcugugc ucuccccccc 564 0 
cccaaaaaga ccccgacgcc uccuccaagg agacgccgga cagugggucu gagcgagagc 5700 
accauaggag augcccucca acagcuggcc aucaaguccu uuggcoagcc ccccccaagc 5760 
ggcgauucag gccuuuccac gggggcggac gccgccgacu ccggcgaucg gacaoccccu 5820 
gacgaguugg cucuuucgga gacagguucu accuccucca ugcccccccu cgagggggag 5880 
ccuggggacc cagaccugga gccugagcag guagagcuuc aaccuccucc ccaggggggg 5 94 0 
gaggcagcuc ccggcucgga cucggggucc uggucuacuu gcucegagga ggaugacucc 6000 
gucgugugcu gcuccauguc auauuccugg accggggcuc uaauaacucc uuguagcccc 6060 
gaagaggaaa aguugccaau uaacucauug agcaacucgc uguugcgaua ccauaacaag 6120 
guauacugua cuacaucaaa gagugccuca cuaagggcua aaaagguaac uuuugauagg 6180 
augcaagugc ucgacgccua uuaugauuoa gucuuaaagg acaucaagcu agcggccucc 624 0 
aaggucagcg caaggcuccu caccuuagag gaggcgugcc aauugacccc accccacucu 63 00 
gcaagaucca aguauggguu uggggcuaag gagguccgca gcuuguccgg gagggccguc 63 60 
aaccacauca aguccgugug gaaggaccuc uuggaagacu cacaaacacc aauuccuaca 64 2 0 
accaucaugg ccaaaaauga gguguucugc guggaccccg ccaagggggg uaaaaaacca 64 80 
gcucgccuua ucguuuaccc ugaccucggc gucagggucu gcgagaagau ggcccuuuau 6540 
gaugucacac aaaagcuucc ucaggcggug augggggcuu cuuauggcuu ccaguacucc 6600 
cccgcucagc ggguggaguu ucucuugaag gcaugggcgg aaaagagaga cccuaugggu 6660 
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uuuucguaug auacccgaug cuuugacuca accgucacug agagagacau caggacugag 6720 
gaguccauau accaggccug cuccuuaccc gaggaggccc gaacugccau acacucgcug 6780 
acugagagac ucuauguggg agggcccaug uucaacagca agggccaguc cugcggguac 6840 
aggcguugcc gcgccagcgg ggugcuuacc acuaguaugg ggaacaccau cacaugcuau 6900 
guaaaagccc uagcggcuug caaggcugcg gggauaauug cgcccacgau gcugguaugc 6960 
ggcgacgacu uggucgucau cucagaaagc caggggacug aggaggacga gcggaaccug 7 02 0 
agagccuuca cggaggcuau gaccagguau ucugccccuc cuggugaccc ccccagaccg 7080 
gaauaugacc uggagcuaau aacaucuugu uccucaaacg ugucuguggc acuuggccca 7140 
cagggccgcc gcagauacua ccugaccaga gaccccaeca cuucaauugc ccgggcugcc 7200 
ugggaaacag uuagacacuc cccugucaau ucauggcugg gaaacaucau ccaguacgcu 7 2 60 
ccaaccauau ggguucgcau gguccugaug acacacuucu uouccauucu cauggcccag 7320 
gacacccuag accagaaccu uaacuuugaa auguacggau cgguguacuc cgugaguccu 73 80 
cuggaccucc cagccauaau ugaaagguua cacgggcuug acgccuucuc ucugcacaca 74 4 0 
uacacucccc acgaacugac gcggguggcu ucagcccuca gaaaacuugg ggcgccaccc 7500 
cucagagcgu ggaagagucg ggcgcgugca guuagggcgu cccucaucuc ccgugggggg 7560 
agggcggccg uuugcggucg guaccucuuc aacugggcgg ugaagaccaa gcucaaacuc 7 62 0 
acuccuuugc cggaggcacg ccuccuggau uuguccaguu gguuuaccgu cggcgccggc 7680 
gggggcgaca uuuaucacag cgugucgcgu gcccgacccc gccuauuacu ccuuagccua 7740 
cuccuacuuu cuguaggggu aggccucuuc cuacuccccg cucgauagag cggcacacau 78 00 
uagcuacacu ccauagcuaa cuguuccuuu uuuuuuuuuu uuuuuuuuuu uuuuuuuuuu 7 8 60 
uuuuuuuuuu cuuuuuuuuu uuuuucccuc uuucuucocu ucucauouua uucuacuuuc 792 0 
uuucuuggug gcuccaucuu agcccuaguc acggcuagcu gugaaagguc cgugagccgc 7980 
augacugcag agagugccgu aacuggucuc ucugcagauc augu 8 024 



<210> 3 
<211> 9678 
<212> DNA 

<213> Hepatitis C virus 

<220> 
<221> CDS 
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<222> (341) . . {9442) 



<400> 3 

acctgcccct aataggggcg acactccgcc atgaatcact cccctgtgag gaactactgt 60 



cttcacgcag aaagcgccta gccatggcgt tagtatgagt gtcgtacagc ctccaggccc 12 0 



ccccctcccg ggagagccat agtggtctgc ggaaccggtg agtacaccgg aattgccggg 180 



aagactgggt cctttcttgg ataaacccac tctatgcccg gccatttggg cgtgcccccg 24 0 



caagactgct agccgagtag cgttgggttg cgaaaggcct tgtggtactg cctgataggg 300 



cgcttgcgag tgccccggga ggtctcgtag accgtgcacc atg age aca aat cot 355 
Met Ser Thr Asn Pro 



aaa cct caa aga aaa acc aaa aga aac acc aac cgt cgc cca gaa gac 4 03 
Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Aen Arg Arg Pro Glu Asp 



gtt aag ttc ccg ggc ggc ggc cag ate gtt ggc gga gta tac ttg ttg 451 
Val Lys Phe Pro Gly Gly Qly Gin lie Val Gly Gly Val Tyr Leu Leu 



ccg cgc agg ggc ccc agg ttg ggt gtg cgc acg aca agg aaa act teg 499 
Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Thr Thr Arg Lys Thr Ser 



gag egg tec cag cca cgt ggg aga cgc cag ccc ate ccc aaa gat egg 547 
Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro He Pro Lys Asp Arg 
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cgc tec act ggc aag gcc tgg gga aaa cca ggt cgc ccc tgg ccc eta 595 
Arg Ser Thr Sly Dys Ala Trp Gly Lys Pro Gly Arg Pro Trp Pro Leu 
70 75 80 85 



tat ggg aat gag gga etc ggc tgg gca gga tgg etc etg tec ccc cga 643 
Tyr Gly Asn Glu Gly Leu Gly Trp Ala Giy Trp Leu Leu Ser Pro Arg 
90 95 100 



ggc tct cgc ccc tec tgg ggc ccc act gac ccc egg cat agg teg egc 691 
Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro Arg His Arg Ser Arg 
105 110 115 



aac gtg ggt aaa gtc ate gac acc eta aog tgt ggc ttt gcc gac etc 73 9 
Asn Val Gly Lys Val lie Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
120 125 130 



atg ggg tac ate ccc gtc gta ggc gee ccg ctt agt ggc gcc gcc aga 787 
Met Gly Tyr He Pro Val Val Gly Ala Pro Leu Ser Gly Ala Ala Arg 
135 140 145 



get gtc gcg cac ggc gtg aga gtc ctg gag gac ggg gtt aat tat gca 835 
Ala Val Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
150 155 160 165 



aca ggg aac eta ccc ggt ttc ccc ttt tct ate ttc ttg ctg gcc ctg 883 
Thr Gly Asn Leu Pro Gly Phe Pro Phe Ser He Phe Leu Leu Ala Leu 
170 175 180 



ttg tec tgc ate acc gtt ccg gtc tet get gcc cag gtg aag aat acc 931 
Leu Ser Cys He Thr Val Pro Val Ser Ala Ala Gin Val Lys Asn Thr 
185 190 195 
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agt age age tac atg gtg acc aat gac tgc tec aat gac age ate act 979 
Ser Ser Ser Tyr Met Val Thr Asn Asp Cys Ser Asn Asp Ser lie Thr 
200 205 210 



tgg cag etc gag get geg gtt etc cac gtc ccc ggg tgc gtc ccg tgc 1027 
Trp Gin Leu Glu Ala Ala Val Leu His Val Pro Gly Cys Val Pro Cys 
215 220 225 



gag aga gtg ggg aat aeg tea egg tgt tgg gtg cca gtc teg cca aac 107 5 
Glu Arg Val Gly Asn Thr Ser Arg Cya Trp Val Pro Val Ser Pro Asn 
230 235 240 245 



atg get gtg egg cag cce ggt gee etc aeg cag ggt ctg egg aeg cac 112 3 
Met Ala Val Arg Gin Pro Gly Ala Leu Thr Gin Gly Leu Arg Thr His 
250 255 260 



ate gat atg gtt gtg atg tec gee acc tte tgc tct get etc tac gtg 1171 
He Asp Met Val Val Met Ser Ala Thr Phe Cya Ser Ala Leu Tyr Val 
265 270 275 



ggg gac etc tgt ggc ggg gtg atg etc gcg gcc cag gtg tte ate gtc 1219 
Gly Asp Leu Cys Gly Gly Val Met Leu Ala Ala Gin Val Phe He Val 
280 285 290 



teg ccg cag tac cac tgg ttt gtg caa gaa tgc aat tgc tec ate tac 1267 
Ser Pro Gin Tyr Hia Trp Phe Val Gin Glu Cys Asn Cys Ser He Tyr 
295 300 305 



cct gge acc ate act gga cac cgc atg gca tgg gac atg atg atg aac 1315 
Pro Gly Thr He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn 
310 315 320 325 



57 



JP Application No. 2003-148242 



tgg teg ccc acg gcc acc atg ate etg gcg tac gtg atg cgc gtc ece 1363 

Trp Ser Pro Thr Ala Thr Met lie Leu Ala Tyr val Met Arg Val Pro 
330 335 340 

gag gtc ate ata gac ate gtt age ggg get cac tgg ggc gtc atg ttc 1411 

Glu Val He lie Asp He Val Ser Gly Ala His Trp Gly Val Met Phe 
345 350 355 

ggc ttg gcc tac ttc tct atg cag gga gcg tgg gcg aag gtc att gtc 1459 

Gly Leu Ala Tyr Phe Ser Met Gin Gly Ala Trp Ala Lys Val He Val 
360 365 370 

ate ctt ctg Gtg gcc get ggg gtg gac gcg ggc acc acc acc gtt gga 1507 

He Leu Leu Leu Ala Ala Gly Val Asp Ala Gly Thr Thr Thr Val Gly 
375 380 385 

ggc get gtt gca cgt tec acc aac gtg att gcc ggc gtg ttc age cat 1555 

Gly Ala Val Ala Arg Ser Thr Asn Val He Ala Gly Val Phe Ser His 

390 395 400 405 

ggc cct eag cag aac att cag etc att aac ace aac ggc agt tgg cac 1603 

Gly Pro Gin Gin Asn He Gin Leu He Asn Thr Asn Gly Ser Trp His 
410 415 420 

ate aac cgt act gcc ttg aat tge aat gac tec ttg aac acc ggc ttt 1651 

He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu Asn Thr Gly Phe 
425 430 435 

etc gcg gcc ttg ttc tac acc aac cgc ttt aac teg tea ggg tgt cea 1699 

Leu Ala Ala Leu Phe Tyr Thr Asn Arg Phe Asn Ser Ser Gly Cys Pro 



58 



JP Application No, 2003-148242 



ggg cgc ctg tcc gcc tgc cgc aac ate gag get ttc egg ata ggg tgg 1747 

Gly Arg Leu Ser Ala Cys Arg Asn lie Glu Ala Phe Arg He Gly Trp 
455 460 465 

ggc acc eta cag tae gag gat aat gtc acc aat cca gag gat atg agg 1795 

Gly Thr Leu Gin Tyr Glu Asp Asn Val Thr Asn Pro Glu Asp Met Arg 

470 475 480 485 

ccg tae tgc tgg cac tac cce cea aag ccg tgt ggc gta gtc ccc gcg 184 3 

Pro Tyr Cys Trp His Tyr Pro Pro Lys Pro Cya Gly Val Val Pro Ala 
490 495 500 

agg tot gtg tgt ggc cea gtg tac tgt ttc acc ccc age ccg gta gta 1891 

Arg Ser Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro Val Val 
SOS 510 51S 

gtg ggc acg acc gac aga cgt gga gtg ccc acc tac aca tgg gga gag 193 9 

Val Gly Thr Thr Asp Arg Arg Sly Val Pro Thr Tyr Thr Trp Gly Glu 
520 525 530 

aat gag aca gat gtc ttc eta etg aac age acc cga ccg ccg cag ggc 1987 

Asn Glu Thr Asp Val Phe Leu Leu Asn Ser Thr Arg Pro Pro Gin Gly 
535 540 545 

tea tgg ttc ggc tgc acg tgg atg aac tcc act ggt ttc acc aag act 2035 

Ser Trp Phe Gly Cys Thr Trp Met Asn Ser Thr Gly Phe Thr Lys Thr 

550 555 560 565 

tgt ggc gcg cca cct tgc cgc acc aga get gac ttc aac gcc ago acg 2 08 3 

Cya Gly Ala Pro Pro Cys Arg Thr Arg Ala Asp Phe Asn Ala Ser Thr 
570 575 580 
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gac ttg ttg tgc cct acg gat tgt ttt agg aag cat cct gat gcc act 2131 
Asp Leu Leu Cys Pro Thr Asp Cys Phe Arg Lys Hia Pro Asp Ala Thr 
585 590 595 

tat att aag tgt ggt tct ggg ccc tgg etc aca cca aag tgc ctg gtc 2179 
Tyr lie Lys Cys Gly Ser Gly Pro Trp Leu Thr Pro Lys Cys Leu Val 
600 605 610 

cac tac cct tac aga etc tgg cat tac ccc tgc aca gtc aat ttt acc 222 7 
Hia Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe Thr 
615 620 625 

ate ttc aag ata aga atg tat gta ggg ggg gtt gag cac agg etc acg 227 5 
He Phe Lys lie Arg Met Tyr Val Gly Gly Val Glu His Arg Leu Thr 
630 635 640 645 

gcc gca tgc aac ttc act cgt ggg gat cgc tgc gac ttg gag gac agg 2323 
Ala Ala Cys Asn Phe Thr Arg Gly Asp Arg Cys Asp Leu Glu Asp Arg 
650 655 660 

gae agg agt cag ctg tct cct ctg ttg cac tct acc acg gaa tgg gcc 2371 
Asp Arg Ser Gin Leu Ser Pro Leu Leu His Ser Thr Thr Glu Trp Ala 
665 670 675 

ate ctg ccc tgc acc tac tea gac tta ccc get ttg tea act ggt ctt 2419 
He Leu Pro Cys Thr Tyr Ser Asp Leu Pro Ala Leu Ser Thr Gly Leu 
680 685 690 

etc cae ctt cac cag aac ate gtg gac gta caa tac atg tat ggc etc 2467 
Leu His Leu His Gin Asn He Val Asp Val Gin Tyr Met Tyr Gly Leu 
695 700 705 
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tea cct get ate aca aaa tac gtc gtt cga tgg gag tgg gtg gta etc 2515 
Ser Pro Ala lie Thr Lys Tyr Val Val Arg Trp Glu Trp Val Val Leu 
VIO 715 720 725 

tta ttc ctg etc tta gcg gac gcc aga gtc tgc gcc tgc ttg tgg atg 2S63 
Leu Phe Leu Leu Leu Ala Asp Ala Arg Val Cya Ala Cys Leu Trp Met 
730 735 740 

etc ate ttg ttg ggc cag gee gaa gca gea ttg gag aag ttg gtc gtc 2611 
Leu He Leu Leu Gly Gin Ala Glu Ala Ala Leu Glu Lys Leu Val Val 
745 750 755 

ttg cac get gcg agt gcg get aac tgc cat ggc etc eta tat ttt gcc 2659 
Leu His Ala Ala Ser Ala Ala Asn Cys Hie Gly Leu Leu Tyr Phe Ala 
760 765 770 

ate ttc ttc gtg gca get tgg eac ate agg ggt egg gtg gtc ccc ttg 2707 
lie Phe Phe Val Ala Ala Trp Hia He Arg Gly Arg Val Val Pro Leu 
775 780 785 

aoc acc tat tgc etc act ggc eta tgg ccc ttc tgc eta ctg etc atg 27SS 
Thr Thr Tyr Cys Leu Thr Gly Leu Trp Pro Phe Cys Leu Leu Leu Met 
790 795 800 805 

gca ctg cec egg cag get tat gee tat gac gca cct gtg cac gga eag 2803 
Ala Leu Pro Arg 61n Ala Tyr Ala Tyr Aap Ala Pro Val His Gly Gin 
810 815 820 

ata ggc gtg ggt ttg ttg ata ttg ate ace etc ttc aca cte acc ccg 2851 
He Gly Val Gly Leu Leu He Leu He Thr Leu Phe Thr Leu Thr Pro 
825 830 835 



61 



JP /^plication No. 2003- i 48242 



ggg tat aag acc etc etc ggc cag tgt ctg tgg tgg ttg tgc tat etc 2899 
Gly Tyr Lys Thr Leu Leu Gly Gin Cys Leu Trp Trp Leu Cys Tyr Leu 
840 845 850 



ctg acc ctg ggg gaa gcc atg att cag gag tgg gta cca ccc atg cag 294 7 
Leu Thr Leu Gly Glu Ala Met lie Gin Glu Trp Val Pro Pro Met Gin 
855 860 865 



gtg cgc ggc ggc cgc gat ggc ate gcg tgg gcc gte act ata ttc tgc 2995 
Val Arg Gly Gly Arg Asp Gly lie Ala Trp Ala Val Thr lie Phe Cys 
870 875 880 885 



ccg ggt gtg gtg ttt gac att acc aaa tgg ctt ttg gcg ttg ctt ggg 3043 
Pro Gly Val Val Phe Asp lie Thr Lys Trp Leu Leu Ala Leu Leu Gly 
890 895 900 

cct get tac etc tta agg gee get ttg aca cat gtg ccg tac ttc gtc 3091 
Pro Ala Tyr Leu Leu Arg Ala Ala Leu Thr His Val Pro Tyr Phe Val 
905 910 915 



aga get cac get ctg ata agg gta tgc get ttg gtg aag cag etc gcg 
Arg Ala His Ala Leu lie Arg Val Cys Ala Leu Val Lys Gin Leu Ala 
920 925 930 



ggg ggt agg tat gtt cag gtg gcg eta ttg gcc ctt ggc agg tgg act 318 7 
Gly Gly Arg Tyr Val Gin Val Ala Leu Leu Ala Leu Gly Arg Trp Thr 
935 940 94S 



ggc acc tac ate tat gac cac etc aca cct atg teg gac tgg gcc get 
Gly Thr Tyr He Tyr Asp His Leu Thr Pro Met Ser Aap Trp Ala Ala 
950 955 960 965 
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age ggc ctg cgc gac tta gcg gtc gcc gtg gaa ccc ate ate ttc agt 3283 
Ser Gly Leu Arg Asp Leu Ala Val Ala Val Glu Pro lie He Phe Ser 
970 975 980 



ccg atg gag aag aag gtc ate gte tgg gga gcg gag acg get gca tgt 
Pro Met Glu Lys Lys Val He Val Trp Gly Ala Glu Thr Ala Ala Cys 
985 990 995 



993 gac att eta cat gga ctt ccc gtg tec gee cga etc ggc cag gag 3379 
Gly Asp He Leu His Gly Leu Pro Val Ser Ala Arg Leu Gly Gin Glu 
1000 1005 1010 



ate etc etc ggc cca get gat ggc tac acc tec aag ggg tgg aag etc 3427 
He Leu Leu Gly Pro Ala Asp Gly Tyr Thr Ser Lys Gly Trp Lys Leu 
1015 1020 1025 



ctt get ccc ate act get tat gcc cag eaa aca cga ggc etc ctg gge 3475 
Leu Ala Pro He Thr Ala Tyr Ala Gin Gin Thr Arg Gly Leu Leu Gly 
1030 1035 1040 1045 



gcc ata gtg gtg agt atg acg ggg cgt gac agg aea gaa cag gcc ggg 3523 
Ala He Val Val Ser Met Thr Gly Arg Asp Arg Thr Glu Gin Ala Gly 
1050 1055 1060 



gaa gtc caa ate ctg tec aca gtc tct cag tec ttc etc gga aea acc 3 571 
Glu Val Gin He Leu Ser Thr Val Ser Gin Ser Phe Leu Gly Thr Thr 
1065 1070 1075 



ate teg ggg gtt ttg tgg act gtt tac cac gga get ggc aac aag act 3619 
He Ser Gly Val Leu Trp Thr Val Tyr His Gly Ala Gly Asn Lys Thr 
1080 1085 1090 
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eta gcc ggc tta egg ggt ccg gtc acg cag atg tac teg agt get gag 3667 

Leu Ala Gly Leu Arg Gly Pro Val Thr Gin Met Tyr Ser Ser Ala Glu 
1095 1100 1105 

939 srac ttg gta ggc tgg ccc age eec oct ggg acc aag tct ttg gag 3715 

Gly Asp Leu Val Gly Trp Pro Ser Pro Pro Gly Thr Lys Ser Leu Glu 

1110 1115 1120 1125 

ccg tgc aag tgt gga gcc gtc gac eta tat ctg gtc acg egg aac get 3763 

Pro Cys Lys Cys Gly Ala val Asp Leu Tyr Leu Val Thr Arg Asn Ala 

1130 1135 1140 

gat gtc ate ccg get egg aga ego ggg gac aag egg gga gca ttg etc 3811 

Asp Val He Pro Ala Arg Arg Arg Gly Asp Lys Arg Gly Ala Leu Leu 

1145 1150 1155 

tec ccg aga ccc att tog acc ttg aag ggg too teg ggg ggg ccg gtg 3859 

Ser Pro Arg Pro He Ser Thr Leu Lys Gly Ser Ser Gly Gly Pro Val 
1160 1165 1170 

etc tge eet agg ggc eae gtc gtt ggg etc ttc cga gea get gtg tgc 3907 

Leu Cys Pro Arg Gly His Val Val Gly Leu Phe Arg Ala Ala Val Cys 
1175 1180 1185 

tct egg ggc gtg gcc aaa tec ate gat ttc ate ccc gtt gag aca etc 3955 

Ser Arg Gly Val Ala Lys Ser He Asp Phe He Pro Val Glu Thr Leu 

1190 1195 1200 1205 

gac gtt gtt aca agg tct ccc act tte agt gac aac age acg eca ccg 4003 

Asp Val Val Thr Arg Ser Pro Thr Phe Ser Asp Asn Ser Thr Pro Pro 

1210 1215 1220 
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get gtg ccc cag acc tat cag gtc ggg tac ttg cat get cca act ggc 4051 
Ala Val Pro Gin Thr Tyr Gin Val Gly Tyr Leu His Ala Pro Thr Gly 
1225 1230 1235 



agt gga aag age acc aag gtc cct gtc gcg tat gcc gcc cag ggg tac 409 9 
Ser Gly Lys Ser Thr Lys Val Pro Val Ala Tyr Ala Ala Sin Gly Tyr 
1240 1245 1250 



aaa gta eta gtg ctt aac ccc teg gta get gcc acc ctg ggg ttt ggg 4147 
Lys Val Leu Val Leu Asn Pro Ser Val Ala Ala Thr Leu Gly Phe Gly 
1255 1260 1265 



gcg tac eta tec aag gca cat ggc ate aat ccc aac att agg act gga 4195 
Ala Tyr Leu Ser Lys Ala His Gly lie Asn Pro Asn He Arg Thr Gly 
1270 127S 1280 1285 



gtc agg acc gtg atg acc ggg gag gcc ate acg tac tee aea tat ggc 4243 
Val Arg Thr Val Met Thr Gly Glu Ala He Thr Tyr Ser Thr Tyr Gly 
1290 1295 1300 



aaa ttt etc gcc gat ggg ggc tgc get age ggc gcc tat gac ate ate 4291 
Lys Phe Leu Ala Asp Gly Gly Cys Ala Ser Gly Ala Tyr Asp He He 
1305 1310 1315 



ata tgc gat gaa tgc eac get gtg gat get acc tee att etc ggc ate 433 9 
He Cys Asp Glu Cys Hts Ala Val Asp Ala Thr Ser He Leu Gly He 
1320 1325 1330 



gga acg gtc ctt gat caa gca gag aea gcc ggg gtc aga eta act gtg 4387 
Gly Thr Val Leu Asp Gin Ala Glu Thr Ala Gly Val Arg Leu Thr Val 
1335 1340 1345 
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ctg get acg gcc aca ccc ccc ggg tea gtg aca acc ccc cat ccc gat 443 5 
Leu Ala Thr Ala Thr Pro Pro Gly Ser Val Thr Thr Pro His Pro Asp 
1350 1355 1360 136S 

ata gaa gag gta ggc etc ggg egg gag ggt gag ate cec ttc tat ggg 4483 
lie Glu Glu Val Gly Leu Gly Arg Glu Gly Glu lie Pro Phe Tyr Gly 
1370 1375 1380 

agg gcg att cce eta tee tgc ate aag gga ggg aga cac ctg att ttc 4531 
Arg Ala lie Pro Leu Ser Cys lie Lys Gly Gly Arg His Leu He Phe 
1385 1390 1395 

tgc cac tea aag aaa aag tgt gac gag etc gcg gcg gcc ctt egg ggc 4579 
Cys Hie Ser Lys Lys Lys Cys Asp Glu Leu Ala Ala Ala Leu Arg Gly 
1400 1405 1410 

atg ggc ttg aat gcc gtg gca tac tat aga ggg ttg gac gtc tec ata 4 627 
Met Gly Leu Asn Ala Val Ala Tyr Tyr Arg Gly Leu Asp Val Ser He 
1415 1420 1425 

ata cca get cag gga gat gtg gtg gtc gtc gcc acc gac gcc etc atg 4675 
He Pro Ala Gin Gly Asp Val Val Val Val Ala Thr Asp Ala Leu Met 
1430 1435 1440 1445 

acg ggg tac act gga gac ttt gac tec gtg ate gac tgc aat gta gcg 4723 
Thr Gly Tyr Thr Gly Asp Phe Asp Ser Val He Asp Cys Asn Val Ala 
1450 1455 1460 

gtc aec caa get gtc gac ttc age ctg gac ecc acc ttc act ata acc 4771 
Val Thr Gin Ala Val Asp Phe Ser Leu Asp Pro Thr Phe Thr He Thr 
1465 1470 1475 
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aca cag act gtc cca caa gac get gtc tea ege agt cag cgc cgc ggg 4819 
Thr 61n Thr val Pro Gin Asp Ala Val Ser Arg Ser Gin Arg Arg 61y 
1480 1485 1490 

cgc aca ggt aga gga aga cag ggc act tat agg tat gtt tec act ggt 4867 
Arg Thr Gly Arg Gly Arg Gin Gly Thr Tyr Arg Tyr Val Ser Thr Gly 
1495 1500 1505 

gaa cga gcc tea gga atg ttt gac agt gta gtg ett tgt gag tgc tac 4915 
Glu Arg Ala Ser Gly Met Phe Asp Ser Val Val Leu Cys Glu Cys Tyr 
1510 1515 1520 1525 

gac gca ggg get gcg tgg tac gat etc aca cca gcg gag acc acc gtc 4 963 
Asp Ala Gly Ala Ala Trp Tyr Asp Leu Thr Pro Ala Glu Thr Thr Val 
1530 1535 1540 

agg ett aga gcg tat ttc aac acg ccc ggc eta ccc gtg tgt eaa gac 5011 
Arg Leu Arg Ala Tyr Phe Asn Thr Pro Gly Leu Pro Val Cys Gin Asp 
1545 1550 1555 

cat ett gaa ttt tgg gag gca gtt ttc acc ggc etc aca eac ata gac 505 9 
His Leu Glu Phe Trp Glu Ala Val Phe Thr Gly Leu Thr His lie Asp 
1560 1565 1570 

gcc eac tte etc tec caa aca aag caa gcg ggg gag aac ttc gcg tac 5107 
Ala His Phe Leu Ser Gin Thr Lys Gin Ala Gly Glu Asn Phe Ala Tyr 
1575 1580 1585 

eta gta gcc tac caa get acg gtg tgc gcc aga gcc aag gee cct ccc 5155 
Leu Val Ala Tyr Gin Ala Thr Val Cya Ala Arg Ala Lys Ala Pro Pro 
1590 1595 1600 1605 
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ccg tec tgg gac gcc atg tgg aag tgc ctg gcc cga etc aag cct acg 5203 
Pro Ser Trp Asp Ala Met Trp Lys Cys Leu Ala Arg Leu Lys Pro Thr 
1610 1615 1620 

ctt gcg ggc ccc aca cct etc ctg tac cgt ttg ggc cct att acc aat 5251 
Leu Ala Gly Pro Thr Pro Leu Leu Tyr Arg Leu Qly Pro lie Thr Asn 
1625 1630 1635 

gag gtc acc etc aca cac cct ggg acg aag tac ate gcc aca tgc atg 5299 
Glu Val Thr Leu Thr His Pro Gly Thr Lys Tyr He Ala Thr Cys Met 
1640 1645 1650 

caa get gac ctt gag gtc atg acc age acg tgg gtc eta get gga gga S347 
Gin Ala Asp Leu Glu Val Met Thr Ser Thr Trp Val Leu Ala Gly Gly 
1655 1660 1665 

gtc ctg gca gcc gtc gcc gca tat tgc ctg gcg act gga tgc gtt tec 5395 
Val Leu Ala Ala Val Ala Ala Tyr Cys Leu Ala Thr Gly Cys Val Ser 
1670 1675 1680 1685 

ate ate ggc ege ttg cac gtc aac cag cga gtc gtc gtt gcg ccg gat 544 3 
He He Gly Arg Leu His Val Asn Gin Arg Val Val Val Ala Pro Asp 
1690 1695 1700 

aag gag gtc etg tat gag get ttt gat gag atg gag gaa tgc gcc tct 5491 
Lys Glu Val Leu Tyr Glu Ala Phe Asp Glu Met Glu Glu Cys Ala Ser 
1705 1710 1715 

agg gcg get etc ate gaa gag ggg cag egg ata gcc gag atg ttg aag 5539 
Arg Ala Ala Leu He Glu Glu Gly Gin Arg He Ala Glu Met Leu Lys 
1720 1725 1730 
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tec aag ate caa ggc ttg ctg cag cag gcc tct aag cag gcc cag gac 5587 

Ser Lye lie Gin 61y Leu Leu Gin Gin Ala Ser Lys Gin Ala Gin Asp 
1735 1740 1745 

ata caa ccc get atg cag get tea tgg ccc aaa gtg gaa caa ttt tgg 563 5 

He Gin Pro Ala Met Gin Ala Ser Trp Pro Lys Val Glu Gin Phe Trp 
1750 1755 1760 1765 

gcc aga cac atg tgg aac ttc att age ggc ate caa tac etc gca gga 5683 

Ala Arg His Met Trp Asn Phe He Ser Gly He Gin Tyr Leu Ala Qly 
1770 1775 1780 

ttg tea aca ctg cca ggg aac ccc geg gtg get tec atg atg gca ttc 5731 

Leu Ser Thr Leu Pro Gly Asn Pro Ala Val Ala Ser Met Met Ala Phe 
1785 1790 1795 

agt gcc gcc etc acc agt ccg ttg teg acc agt acc acc ate ctt etc 5779 

Ser Ala Ala Leu Thr Ser Pro Leu Ser Thr Ser Thr Thr lie Leu Leu 
1800 1805 1810 

aac ate atg gga ggc tgg tta gcg tec cag ate gca cca ccc gcg ggg 582 7 

Asn He Met Gly Gly Trp Leu Ala Ser Gin He Ala Pro Pro Ala Gly 
1815 1820 1825 

gcc acc ggc ttt gtc gtc agt ggc ctg gtg ggg get gcc gtg ggc age 587 S 

Ala Thr Gly Phe Val Val Ser Gly Leu Val Gly Ala Ala Val Gly Ser 

1830 1835 1840 1845 

ata gge ctg ggt aag gtg ctg gtg gac ate ctg gca gga tat ggt gcg 5923 

He Gly Leu Gly Lye Val Leu Val Asp He Leu Ala Qly Tyr Gly Ala 
1850 1855 1860 
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ggc att teg ggg gcc etc gtc gca ttc aag ate atg tct ggc gag aag 5971 
Gly lie Ser 61y Ala Leu Val Ala Phe Lys lie Met Ser Gly Glu Lys 
1865 1870 1875 



ccc tct atg gaa gat gtc ate aat eta ctg eet ggg ate ctg tct ccg 6019 
Pro Ser Met Glu Asp Val lie Asn Leu Leu Pro Gly lie Leu Ser Pro 
1880 1885 1890 



gga gee ctg gtg gtg ggg gtc ate tgc geg gee att ctg cgc cgc cac 6067 
Gly Ala Leu Val Val Gly Val lie Cys Ala Ala lie Leu Arg Arg His 
1895 1900 1905 



gtg gga ccg ggg gag ggc gcg gtc caa tgg atg aac agg ctt att gcc 6115 
Val Gly Pro Gly Glu Gly Ala Val Gin Trp Met Asn Arg Leu lie Ala 
1910 191S 1920 1925 



ttt get tec aga gga aac cac gtc gcc cct act cac tae gtg acg gag 6163 
Phe Ala Ser Arg Gly Asn His Val Ala Pro Thr His Tyr Val Thr Glu 
1930 1935 1940 



teg gat gcg teg cag cgt gtg acc caa eta ctt ggc tct ctt act ata 6211 
Ser Asp Ala Ser Gin Arg Val Thr Gin Leu Leu Gly Ser Leu Thr lie 
1945 1950 1955 



acc age eta etc aga aga etc cac aat tgg ata act gag gac tgc cce 6259 
Thr Ser Leu Leu Arg Arg Leu His Asn Trp He Thr Glu Asp Cys Pro 
1960 1965 1970 



ate cca tgc tec gga tec tgg etc egc gac gtg tgg gac tgg gtt tgc 6307 
He Pro Cys Ser Gly Ser Trp Leu Arg Asp Val Trp Asp Trp Val Cys 
1975 1980 1985 
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acc ate ttg aca gac ttc aaa aat tgg ctg acc tct aaa ttg ttc ccc 6355 
Thr lie Leu Thr Asp Phe Lys Asn Trp Leu Thr Ser Lys Leu Phe Pro 
1990 1995 2000 2005 



aag ctg ccc ggc etc ccc ttc ate tct tgt caa aag ggg tac aag ggt 6403 
Lys Leu Pro Gly Leu Pro Phe lie Ser Cys Gin Lys Gly Tyr Lys Gly 
2010 2015 2020 



gtg tgg gcc ggc act ggc ate atg aec acg cgc tgc ect tge ggc gee 6451 
Val Trp Ala Gly Thr Gly lie Met Thr Thr Arg Cys Pro Cys Gly Ala 
2025 2030 2035 



aac ate tot gge aat gtc egc ctg ggc tct atg agg ate aca ggg cct 6499 
Asn lie Ser Gly Asn Val Arg Leu Gly Ser Met Arg lie Thr Gly Pro 
2040 2045 2050 



aaa acc tgc atg aac acc tgg cag ggg acc ttt cct ate aat tgc tac 6547 
Lys Thr Cys Met Asn Thr Trp Gin Gly Thr Phe Pro lie Asn Cys Tyr 
2055 2060 2065 



acg gag ggc eag tge geg ecg aaa ccc cce acg aac tac aag ace gcc 6595 
Thr Glu Gly Gin Cys Ala Pro Lys Pro Pro Thr Asn Tyr Lys Thr Ala 
2070 2075 2080 2085 



ate tgg agg gtg geg gee teg gag tac geg gag gtg aeg cag cat ggg 664 3 
lie Trp Arg Val Ala Ala Ser Glu Tyr Ala Glu Val Thr Gin His Gly 
2090 2095 2100 



teg tac tec tat gta aca gga ctg acc act gac aat ctg aaa att cct 6691 
Ser Tyr Ser Tyr Val Thr Gly Leu Thr Thr Asp Asn Leu Lys lie Pro 
2105 2110 2115 
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tgc caa eta cct tct cca gag ttt ttc tec tgg gtg gac ggt gtg cag 673 9 
Cys Gin Leu Pro Ser Pro Glu Phe Phe Ser Trp Val Asp Gly Val Gin 
2120 2125 2130 

ate cat agg ttt gca ccc aca cca aag ccg ttt ttc egg gat gag gtc 6787 
lie His Arg Phe Ala Pro Thr Pro Lys Pro Phe Phe Arg Asp Glu Val 
2135 2140 2145 

teg ttc tgc gtt ggg ctt aat tec tat get gtc ggg tec cag ctt ccc 683 5 
Ser Phe Cys Val Gly Leu Asn Ser Tyr Ala Val Gly Ser Gin Leu Pro 
2150 2155 2160 2165 

tgt gaa cet gag ccc gac gca gac gta ttg agg tec atg eta aca gat 6883 
Cys Glu Pro Glu Pro Asp Ala Asp Val Leu Arg Ser Met Leu Thr Asp 
2170 2175 2180 

ccg ccc cac ate acg gcg gag act gcg gcg egg cgc ttg gca egg gga 6931 
Pro Pro His He Thr Ala Glu Thr Ala Ala Arg Arg Leu Ala Arg Gly 
2185 2190 2195 

tea cet cca tct gag gcg age tec tea gtg ago cag eta tea gca ceg 6979 
Ser Pro Pro Ser Glu Ala Ser Set Ser Val Ser Gin Leu Ser Ala Pro 
2200 2205 2210 

teg ctg egg gcc acc tgc acc acc cac age aac ace tat gac gtg gac 7027 
Ser Leu Arg Ala Thr Cys Thr Thr His Ser Asn Thr Tyr Asp Val Asp 
221B 2220 2225 

atg gtc gat gcc aac ctg etc atg gag ggc ggt gtg get cag aca gag 7075 
Met Val Asp Ala Asn Leu Leu Met Glu Gly Gly Val Ala Gin Thr Glu 
2230 2235 2240 2245 
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cct gag tec agg gtg ccc gtt ctg gac ttt etc gag cca atg gcc gag 7123 
Pro Glu Ser Arg Val Pro Val Leu Asp Phe Leu Glu Pro Met Ala Glu 
2250 2255 2260 



gaa gag age gac ctt gag ccc tea ata cca teg gag tgc atg etc ccc 7171 
Glu Glu Ser Asp Leu Glu Pro Ser He Pro Ser Glu Cys Met Leu Pro 
2265 2270 2275 



agg age ggg ttt cca egg gee tta ecg get tgg gea egg cct gae tac 7219 
Arg Ser Gly Phe Pro Arg Ala Leu Pro Ala Trp Ala Arg Pro Asp Tyr 
2280 2285 2290 



aac ccg ccg etc gtg gaa teg tgg agg agg cca gat tac caa cog ccc 7267 
Asn Pro Pro Leu Val Glu Ser Trp Arg Arg Pro Asp Tyr Gin Pro Pro 
2295 2300 2305 

acc gtt get ggt tgt get etc ccc cce eec aag aag gcc ceg acg cct 7315 
Thr Val Ala Gly Cys Ala Leu Pro Pro Pro Lys Lys Ala Pro Thr Pro 
2310 2315 2320 2325 



cce eea agg aga cgc egg aca gtg ggt etg age gag age acc ata tea 7363 
Pro Pro Arg Arg Arg Arg Thr Val Gly Leu Ser Glu Ser Thr He Ser 
2330 2335 2340 



gaa gcc etc cag caa ctg gcc ate aag acc ttt ggc cag cce cec teg 7411 
Glu Ala Leu Gin Gin Leu Ala He Lys Thr Phe Gly Gin Pro Pro Ser 
2345 2350 2355 



age ggt gat gea ggc teg toe acg ggg gcg ggc gcc gcc gaa tec ggc 74 5 9 
Ser Gly Asp Ala Gly Ser Ser Thr Gly Ala Gly Ala Ala Glu Ser Gly 
2360 2365 2370 
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ggt ccg acg tec cct ggt gag ccg gcc ccc tea gag aca ggt tec gee 7507 
Gly Pro Thr Ser Pro Gly Glu Pro Ala Pro Ser Glu Thr Gly Ser Ala 
2375 2380 2385 

tec tct atg ccc ccc etc gag ggg gag cct gga gat ccg gac ctg gag 75S5 
Ser Ser Met Pro Pro Leu Glu Gly Glu Pro Gly Asp Pro Aap Leu Glu 
2390 2395 2400 2405 

tct gat cag gta gag ctt caa cct ccc ccc cag ggg ggg ggg gta get 7603 
Ser Asp Gin Val Glu Leu Gin Pro Pro Pro Gin Gly Gly Gly Val Ala 
2410 2415 2420 

ccc ggt teg ggc teg ggg tct tgg tct act tgc tee gag gag gac gat 7651 
Pro Gly Ser Gly Ser Gly Ser Trp Ser Thr Cya Ser Glu Glu Asp Asp 
2425 2430 2435 

ace ace gtg tgc tgc tee atg tea tac tec tgg acc ggg get eta ata 7699 
Thr Thr Val Cye Cys Ser Met Ser Tyr Ser Trp Thr Gly Ala Leu lie 
2440 2445 2450 

act ccc tgt age ccc gaa gag gaa aag ttg cea ate aac cct ttg agt 7747 
Thr Pro Cys Ser Pro Glu Glu Glu Lys Leu Pro lie Aen Pro Leu Ser 
2455 2460 2465 

aac teg ctg ttg cga tac cat aac aag gtg tac tgt aca aca tea aag 7795 
Asn Ser Leu Leu Arg Tyr His Asn Lys Val Tyr Cys Thr Thr Ser Lys 
2470 2475 2480 2485 

age gcc tea cag agg get aaa aag gta act ttt gac agg acg caa gtg 7843 
Ser Ala Ser Gin Arg Ala Lys Lys Val Thr Phe Asp Arg Thr Gin Val 
2490 2495 2500 
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etc gac gcc cat tat gac tea gtc tta aag gac ate aag eta gcg get 7891 
Leu Asp Ala His Tyr Asp Ser Val Leu Lye Asp lie Lys Leu Ala Ala 
2505 2510 2515 

tec aag gtc age gca agg etc etc acc ttg gag gag gcg tgc cag ttg 793 9 
Ser Lys val Ser Ala Arg Leu Leu Thr Leu Glu Glu Ala Cys Gin Leu 
2520 2525 2530 

act cca ccc eat tet gea aga tec aag tat gga ttc ggg gcc aag gag 7987 
Thr Pro Pro His Ser Ala Arg Ser Lys Tyr. Gly Phe Gly Ala Lys Glu 
2535 2540 2345 

gtc cge age ttg tec ggg agg gcc gtt aac cac ate aag tec gtg tgg 803 5 
Val Arg Ser Leu Ser Gly Arg Ala Val Asn His lie Lys Ser Val Trp 
2550 2555 2560 2565 

aag gac etc ctg gaa gac cca caa aca cca att ccc aca acc ate atg 808 3 
Lys Asp Leu Leu Glu Asp Pro Gin Thr Pro He Pro Thr Thr lie Met 
2570 2575 2580 

gcc aaa aat gag gtg ttc tgc gtg gac ccc gcc aag ggg ggt aag aaa 8131 
Ala Lys Asn Glu Val Phe Cys Val Asp Pro Ala Lys Gly Gly Lys Lys 
2585 2590 2595 

cca get cgc etc ate gtt tac cct gac etc ggc gtc egg gtc tgc gag 8179 
Pro Ala Arg Leu He Val Tyr Pro Asp Leu Gly Val Arg Val Cys Glu 
2600 2605 2610 

aaa atg gcc etc tat gac att aea caa aag ctt ect eag gcg gta atg 8227 
Lys Met Ala Leu Tyr Asp He Thr Gin Lys Leu Pro Gin Ala Val Met 
2615 2620 2625 
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gga get tec tat ggc ttc cag tac tec cct gcc caa egg gtg gag tat 8275 

Gly Ala Ser Tyr Gly Phe Gin Tyr Ser Pro Ala Gin Arg Val Glu Tyr 
2630 2635 2640 2645 

etc ttg aaa gca tgg gcg gaa aag aag gac ccc atg ggt ttt teg tat 8323 

Leu Leu Lys Ala Trp Ala Glu Lys Lys Asp Pro Met Gly Phe Ser Tyr 
2650 2655 2660 

gat ace cga tge ttc gac tea acc gtc act gag aga gac ate agg acc 8371 

Aep Thr Arg Cys Phe Asp Ser Thr Val Thr Glu Arg Asp lie Arg Thr 

2665 2670 2675 

gag gag tec ata tac cag gcc tgc tec ctg ccc gag gag gcc cgc act 8419 

Glu Glu Ser He Tyr Gin Ala Cys Ser Leu Pro Glu Glu Ala Arg Thr 

2680 2685 2690 

gcc ata cac teg ctg act gag aga ctt tac gta gga ggg ecc atg ttc 8467 

Ala He His Ser Leu Thr Glu Arg Leu Tyr Val Gly Gly Pro Met Phe 
2695 2700 2705 

aac age aag ggt caa acc tgc ggt tac aga cgt tge cgc gcc age ggg 8 515 

Aen Ser Lys Gly Gin Thr Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly 
2710 2715 2720 2725 

gtg eta acc act age atg ggt aac acc ate aca tgc tat gtg aaa gcc 8563 

Val Leu Thr Thr Ser Met Gly Asn Thr lie Thr Cys Tyr Val Lys Ala 
2730 2735 2740 

eta gcg gcc tge aag get gcg ggg ata gtt gcg ccc aca atg ctg gta 8611 

Leu Ala Ala Cys Lys Ala Ala Gly He Val Ala Pro Thr Met Leu Val 

2745 2750 275S 
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tgc ggc gat gac eta gta gtc ate tea gaa age cag ggg act gag gag 865 9 

Cys Gly Asp Asp Leu Val Val lie Ser Glu Ser Gin Gly Thr Glu Glu 
Z760 2765 2770 

gac gag egg aac ctg aga gee ttc acg gag gee atg acc agg tac tct 8707 

Asp Glu Arg Aan Leu Arg Ala Phe Thr Glu Ala Met Thr Arg Tyr Ser 
2775 2780 2785 

gee cct cct ggt gat ccc ccc aga ccg gaa tat gac ctg gag eta ata 8755 

Ala Pro Pro Gly Asp Pro Pro Arg Pro Glu Tyr Asp Leu Glu Leu lie 
2790 2795 2800 2805 

aca tec tgt tec tea aat gtg tct gtg geg ttg ggc ccg egg ggc cgc 8803 

Thr Ser Cys Ser Ser Asn Val Ser Val Ala Leu Gly Pro Arg Gly Arg 
2810 2815 2820 

cgc aga tac tac ctg acc aga gac cca acc act cca etc gcc egg get 8851 

Arg Arg Tyr Tyr Leu Thr Arg Asp Pro Thr Thr Pro Leu Ala Arg Ala 
2825 2830 2835 

gee tgg gaa aca gtt aga cac tec cct ate aat tea tgg ctg gga aac 8899 

Ala Trp Glu Thr Val Arg His Ser Pro lie Asn Ser Trp Leu Gly Asn 
2840 2845 2850 

ate ate cag tat get cca acc ata tgg gtt cgc atg gtc eta atg aca 8947 

rie He Gin Tyr Ala Pro Thr He Trp Val Arg Met Val Leu Met Thr 
2855 2860 2865 

cac ttc ttc tec att etc atg gtc caa gac acc ctg gac cag aac etc 899 5 

His Phe Phe Ser He Leu Met Val Gin Asp Thr Leu Asp Gin Aan Leu 
2870 2875 2880 2885 
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aac ttt gag atg tat gga tea gta tac tec gtg aat cct ttg gac ctt 9043 
Asn Phe Glu Met Tyr Gly Ser Val Tyr Ser Val Asn Pro Leu Asp Leu 
2890 2895 2900 

cca gcc ata att gag agg tta cac ggg ctt gac gcc ttt tct atg cac 9091 
Pro Ala He lie Glu Arg Leu His Gly Leu Asp Ala Phe Ser Met His 
2905 2910 2915 

aca tac tct cac cac gaa ctg acg egg gtg get tea gcc etc aga aaa 9139 
Thr Tyr Ser Hie His Glu Leu Thr Arg Val Ala Ser Ala Leu Arg Lys 
2920 2925 2930 

ctt ggg gcg cca cec etc agg gtg tgg aag agt egg get cgc gca gtc 918 7 
Leu Gly Ala Pro Pro Leu Arg Val Trp Lye Ser Arg Ala Arg Ala Val 
2935 2940 2945 

agg gcg tec etc ate tec egt gga ggg aaa gcg gcc gtt tgc ggc cga 923 5 
Arg Ala Ser Leu He Ser Arg Gly Gly Lys Ala Ala Val Cys Gly Arg 
2950 2955 2960 2965 

tat etc ttc aat tgg gcg gtg aag ace aag etc aaa etc act cca ttg 9283 
Tyr Leu Phe Asn Trp Ala Val Lys Thr Lys Leu Lys Leu Thr Pro Leu 
2970 2975 2980 

ccg gag gcg cgc eta ctg gac tta tec agt tgg ttc aec gtc ggc gee 9331 
Pro Glu Ala Arg Leu Leu Asp Leu Ser Ser Trp Phe Thr Val Gly Ala 
2985 2990 2995 

93c ggg ggc gac att ttc cac age gtg teg cgc gcc cga ccc cgc tea 9379 
Gly Gly Gly Asp He Phe His Ser Val Ser Arg Ala Arg Pro Arg Ser 
3000 30OS 3010 
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tta etc ttc ggc eta etc eta ctt ttc gta ggg gta ggc etc ttc eta 9427 
Leu Leu Phe Gly Leu Leu Leu Leu Phe Val Gly Val Gly Leu Phe Leu 
3015 3020 3025 

etc ccc get egg tag agcggcacac aetaggtaca ctccatagct aactgttcct 9482 

Leu Pro Ala Arg 

3030 

tttttttttt tttttttttt tttttttttt tttttttttt ttcttttttt tttttttccc 9542 
tctttcttcc cttctcatct tattctactt tctttcttgg tggctccatc ttagccctag 9602 
tcacggctag ctgtgaaagg tccgtgagcc gcatgactgc agagagtgcc gtaactggte 9662 
tctctgcaga teatgt 9678 



<210> 4 
<211> 3033 
<212> PRT 

<213> Hepatitis C virus 
<400> 4 

Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Aen 

15 10 15 

Arg Arg Pro Glu Asp Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly 

20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Thr 

35 40 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 60 
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He Pro Lys Asp Arg Arg Ser Thr Gly Lys Ala Trp Gly Lys Pro Gly 

65 70 75 80 

Arg Pro Trp Pro Leu Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp 

85 90 95 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro 

100 105 110 

Arg His Arg Ser Arg Asn Val Gly Lya Val He Asp Thr Leu Thr Cys 

lis 120 125 

Gly Phe Ala Asp Leu Met Gly Tyr He Pro Val Val Gly Ala Pro Leu 

130 135 140 

Ser Gly Ala Ala Arg Ala Val Ala His Gly Val Arg Val Leu Glu Asp 
145 150 155 160 

Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Phe Pro Phe Ser He 

165 170 175 

Phe Leu Leu Ala Leu Leu Ser Cys He Thr Val Pro Val Ser Ala Ala 

180 185 190 

Gin Val Lya Asn Thr Ser Ser Ser Tyr Met Val Thr Asn Asp Cys Ser 

195 200 205 

Asn Asp Ser He Thr Trp Gin Leu Glu Ala Ala Val Leu His Val Pro 

210 215 220 

Gly Cys Val Pro Cys Glu Arg Val Gly Asn Thr Ser Arg Cys Trp Val 
225 230 235 240 

Pro Val Ser Pro Asn Met Ala Val Arg Gin Pro Gly Ala Leu Thr Gin 

245 250 255 

Gly Leu Arg Thr His He Asp Met Val Val Met Ser Ala Thr Phe Cys 

260 265 270 

Ser Ala Leu Tyr Val Gly Asp Leu Cys Gly Gly Val Met Leu Ala Ala 

275 280 285 

Gin Val Phe He Val Ser Pro Gin Tyr His Trp Phe Val Gin Glu Cys 

290 295 300 

Asn Cys Ser He Tyr Pro Gly Thr He Thr Gly His Arg Met Ala Trp 
305 310 315 320 
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Asp Met Met Met Asn Trp Ser Pro Thr Ala Thr Met He Leu Ala Tyr 

325 330 335 

Val Met Arg val Pro Glu Val lie He Asp He Val Ser Gly Ala His 

340 345 350 

Trp Gly Val Met Phe Gly Leu Ala Tyr Phe Ser Met Gin Gly Ala Trp 

355 360 365 

Ala Lys Val He Val He Leu Leu Leu Ala Ala Gly Val Asp Ala Gly 

370 375 380 

Thr Thr Thr Val Gly Gly Ala Val Ala Arg Ser Thr Asn Val He Ala 
385 390 395 400 

Gly Val Phe Ser His Gly Pro Gin Gin Asn He Gin Leu He Asn Thr 

405 410 415 

Asn Gly Ser Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser 

420 425 430 

Leu Asn Thr Gly Phe Leu Ala Ala Leu Phe Tyr Thr Asn Arg Phe Asn 

435 440 445 

Ser Ser Gly Cys Pro Gly Arg Leu Ser Ala Cys Arg Asn He Glu Ala 

450 455 460 

Phe Arg He Gly Trp Gly Thr Leu Gin Tyr Glu Asp Asn Val Thr Asn 
465 470 475 480 

Pro Glu Asp Met Arg Pro Tyr Cys Trp His Tyr Pro Pro Lys Pro Cys 

485 490 495 

Gly Val Val Pro Ala Arg Ser Val Cys Gly Pro Val Tyr Cys Phe Thr 

500 505 510 

Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Arg Gly Val Pro Thr 

515 520 525 

Tyr Thr Trp Gly Glu Asn Glu Thr Asp Val Phe Leu Leu Asn Ser Thr 

530 535 540 

Arg Pro Pro Gin Gly Ser Trp Phe Gly Cys Thr Trp Met Asn Ser Thr 
545 550 555 560 

Gly Phe Thr Lys Thr Cys Gly Ala Pro Pro Cys Arg Thr Arg Ala Asp 
565 570 575 
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Phe Asn Ala Ser Thr Asp Leu Leu Cya Pro Thr Asp Cys Phe Arg Lys 

580 585 590 

His Pro Asp Ala Thr Tyr lie Lys Cys Gly Ser 61y Pro Trp Leu Thr 

595 600 605 

Pro Lys Cys Leu Val Kis Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys 

610 615 620 

Thr Val Asn Phe Thr He Phe Lys He Arg Met Tyr Val Gly Gly Val 
625 630 635 640 

Glu His Arg Leu Thr Ala Ala Cys Asn Phe Thr Arg Gly Asp Arg Cys 

645 650 655 

Asp Leu Glu Asp Arg Asp Arg Ser Gin Leu Ser Pro Leu Leu His Ser 

660 665 670 

Thr Thr Glu Trp Ala He Leu Pro Cys Thr Tyr Ser Asp Leu Pro Ala 

675 680 685 

Leu Ser Thr Gly Leu Leu His Leu His Gin Asn rie Val Asp Val Gin 

690 695 700 

Tyr Met Tyr Gly Leu Ser Pro Ala He Thr Lys Tyr Val Val Arg Trp 
705 710 715 720 

Glu Trp Val Val Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg Val Cys 

725 730 735 

Ala Cys Leu Trp Met Leu He Leu Leu Gly Gin Ala Glu Ala Ala Leu 

740 745 750 

Glu Lys Leu Val Val Leu His Ala Ala Ser Ala Ala Asn Cys His Gly 

755 760 765 

Leu Leu Tyr Phe Ala He Phe Phe Val Ala Ala Trp His He Arg Gly 

770 775 780 

Arg Val Val Pro Leu Thr Thr Tyr Cys Leu Thr Gly Leu Trp Pro Phe 

785 790 795 800 

Cys Leu Leu Leu Met Ala Leu Pro Arg Gin Ala Tyr Ala Tyr Asp Ala 

805 810 815 

Pro Val His Gly Gin He Gly Val Gly Leu Leu He Leu He Thr Leu 
820 825 830 
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Phe Thr Leu Thr Pro Gly Tyr Lys Thr Leu Leu Gly Gin Cys Leu Trp 

835 840 845 

Trp Leu Cys Tyr Leu Leu Thr Leu Gly Glu Ala Met He Gin Glu Trp 

850 855 660 

Val Pro Pro Met Gin Val Arg Gly Gly Arg Asp Gly He Ala Trp Ala 
865 870 875 880 

val Thr He Phe Cya Pro Gly Val Val Phe Asp He Thr Lya Trp Leu 

885 890 895 

Leu Ala Leu Leu Gly Pro Ala Tyr Leu Leu Arg Ala Ala Leu Thr Hia 

900 905 910 

Val Pro Tyr Phe Val Arg Ala His Ala Leu He Arg Val Cya Ala Leu 

915 920 925 

val Lys Gin Leu Ala Gly Gly Arg Tyr Val Gin Val Ala Leu Leu Ala 

930 935 940 

Leu Gly Arg Trp Thr Gly Thr Tyr He Tyr Asp His Leu Thr Pro Met 
945 950 955 960 

Ser Aap Trp Ala Ala Ser Gly Leu Arg Asp Leu Ala Val Ala Val Glu 

965 970 975 

Pro He He Phe Ser Pro Met Glu Lya Lys Val He Val Trp Gly Ala 

980 985 990 

Glu Thr Ala Ala Cys Gly Asp He Leu His Gly Leu Pro Val Ser Ala 

995 1000 1005 

Arg Leu Gly Gin Glu He Leu Leu Gly Pro Ala Asp Gly Tyr Thr Ser 

1010 1015 1020 

Lys Gly Trp Lys Leu Leu Ala Pro He Thr Ala Tyr Ala Gin Gin Thr 
1025 1030 1035 1040 

Arg Gly Leu Leu Gly Ala He Val Val Ser Met Thr Gly Arg Aap Arg 

1045 1050 1055 

Thr Glu Gin Ala Gly Glu Val Gin He Leu Ser Thr Val Ser Gin Ser 

1060 1065 1070 

Phe Leu Gly Thr Thr He Ser Gly Val Leu Trp Thr Val Tyr His Gly 
1075 1080 1085 
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Ala Gly Asn Lys Thr Leu Ala Gly Leu Arg Gly Pro Val Thr Gin Met 

1090 1095 1100 

Tyr Ser Ser Ala Glu Gly A.p Leu Val Gly Trp Pro Ser Pro Pro Gly 
XIOS 

Thr Lys ser Leu Glu Pro Cys Ly« Cys Gly Ala Val Asp Leu Tyr Leu 

1130 1"5 
val Thr Arg Asn Ala Asp Val He Pro Ala Arg Arg Arg Gly Asp Lys 
1140 

Arg Gly Ala Leu Leu Ser Pro Arg Pro He Ser Thr Leu Lys Gly Ser 
1160 

ser Gly Gly Pro Val Leu Cya Pro Arg Gly Hi. Val Val Gly Leu Phe 
1170 

Arg Ala Ala Val Cys Ser Arg Gly Val Ala Lys Ser He Asp Phe He 
119S 1200 
Thr Leu ASP Val Val Thr Arg Ser Pro Thr Phe Ser Asp 
1205 1210 ^^^^ 

Asn ser Thr Pro Pro Ala Val Pro Gin Thr Tyr Gin Val Gly Tyr Leu 

1220 1225 1230 

His Ala pro Thr Gly Ser Gly Lys Ser Thr Lys Val Pro Val Ala Tyr 

Ala Ala Gin Gly Tyr Lys Val Lau Val Leu Asn Pro Ser Val Ala Ala 



1185 119' 
Pro val Glu 

1205 



1260 

I Ala Hts Gly He Asn Pro 



. lie Arg Thr Gly Val Arg Thr Val Met Thr Gly Glu Ala He Thr 
1290 1295 



Thr Leu Gly Phe Gly Ala Tyr Leu Ser Lya 
1270 

128S 

Tyr ser Thr Tyr Gly Lys Phe Leu Ala Asp Gly Gly Cys Ala Ser Gly 

1300 1305 1310 

Ala Tyr Aap He He He Cys Asp Glu Cya His Ala Val Asp Ala Thr 

1315 1320 1325 

ser He Leu Gly He Gly Thr Val Leu Asp Gin Ala Glu Thr Ala Gly 
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Val Arg Leu Thr Val Leu Ala Thr Ala Thr Pro Pro Gly Ser Val Thr 

1345 1350 1355 1360 

Thr Pro His Pro Asp He Glu Glu Val Gly Leu Gly Arg Glu Gly Glu 

1365 1370 1375 

He Pro Phe Tyr Gly Arg Ala He Pro Leu Ser Cys He Lya Gly Gly 

1380 1385 1390 

Arg His Leu He Phe Cys Hie Ser Lys Lys Lys Cys Asp Glu Leu Ala 

1395 1400 1405 

Ala Ala Leu Arg Gly Met Gly Leu Asn Ala Val Ala Tyr Tyr Arg Gly 

1410 1415 1420 

Leu Asp Val ser He He Pro Ala Gin Gly Asp Val Val Val Val Ala 
1425 1430 1435 1440 

Thr Asp Ala Leu Met Thr Gly Tyr Thr Gly Asp Phe Asp Ser Val He 

1445 1450 1455 

Asp Cys Asn Val Ala Val Thr Gin Ala Val Asp Phe Ser Leu Asp Pro 

1460 1465 1470 

Thr Phe Thr He Thr Thr Gin Thr Val Pro Gin Asp Ala Val Ser Arg 

1475 1480 1485 

Ser Gin Arg Arg Gly Arg Thr Gly Arg Gly Arg Gin Gly Thr Tyr Arg 

1490 1495 1500 

Tyr Val Ser Thr Gly Glu Arg Ala Ser Gly Met Phe Asp Ser Val Val 
1505 1510 1515 1520 

Leu Cys Glu Cys Tyr Asp Ala Gly Ala Ala Trp Tyr Asp Leu Thr Pro 

1525 1530 1535 

Ala Glu Thr Thr Val Arg Leu Arg Ala Tyr Phe Asn Thr Pro Gly Leu 

1540 1545 1550 

Pro val Cys Gin Asp His Leu Glu Phe Trp Glu Ala Val Phe Thr Gly 

1555 1560 1565 

Leu Thr His He Asp Ala His Phe Leu Ser Gin Thr Lys Gin Ala Gly 

1570 1575 1580 

Glu Asn Phe Ala Tyr Leu Val Ala Tyr Gin Ala Thr Val Cys Ala Arg 
1585 1590 1595 1600 
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Ala Lys Ala Pro Pro Pro Ser Trp Asp Ala Met Trp Lys Cys Leu Ala 

1605 1610 1615 

Arg Leu Lys Pro Thr Leu Ala Qly Pro Thr Pro Leu Leu Tyr Arg Leu 

1620 1625 1630 

61y Pro lie Thr Rsn Glu Val Thr Leu Thr Hie Pro Gly Thr Lys Tyr 

1635 1640 1645 

He Ala Thr Cys Met Gin Ala Asp Leu Glu Val Met Thr Ser Thr Trp 

1650 16SS 1660 

Val Leu Ala Gly Gly Val Leu Ala Ala Val Ala Ala Tyr Cys Leu Ala 
1665 1670 1675 1680 

Thr Gly Cys Val Ser He He Gly Arg Leu His Val Asn Gin Arg Val 

1685 1690 1695 

Val Val Ala Pro Asp Lys Glu Val Leu Tyr Glu Ala Phe Asp Glu Met 

1700 1705 1710 

Glu Glu Cys Ala Ser Arg Ala Ala Leu He Glu Glu Gly Gin Arg He 

1715 1720 1725 

Ala Glu Met Leu Lys Ser Lys He Gin Gly Leu Leu Gin Gin Ala Ser 

1730 1735 1740 

Lys Gin Ala Gin Asp He Gin Pro Ala Met Gin Ala Ser Trp Pro Lys 
1745 1750 1755 1760 

val Glu Gin Phe Trp Ala Arg His Met Trp Asn Phe He Ser Qly He 

1765 1770 1775 

Gin Tyr Leu Ala Gly Leu Ser Thr Leu Pro Gly Asn Pro Ala Val Ala 

1780 1785 1790 

Ser Met Met Ala Phe Ser Ala Ala Leu Thr Ser Pro Leu Ser Thr Ser 

1795 1800 1805 

Thr Thr He Leu Leu Asn He Met Gly Gly Trp Leu Ala Ser Gin He 

1810 1815 1820 

Ala Pro Pro Ala Gly Ala Thr Gly Phe Val Val Ser Gly Leu Val Qly 
1825 1830 1835 1840 

Ala Ala Val Gly Ser He Gly Leu Gly Lys Val Leu Val Asp He Leu 
1845 1850 1855 
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Ala Gly Tyr Gly Ala Qly lie Ser Gly Ala Leu Val Ala Phe Lys lie 

1860 1865 1870 

Met Ser Gly Glu Lys Pro Ser Met Glu Asp Val lie Asn Leu Leu Pro 

1875 1880 1885 

Gly He Leu Ser Pro Gly Ala Leu Val Val Gly Val He Cys Ala Ala 

1890 1895 1900 

He Leu Arg Arg His Val Gly Pro Gly Glu Gly Ala Val Gin Trp Met 
1905 1910 1915 1920 

Asn Arg Leu He Ala Phe Ala Ser Arg Gly Asn His Val Ala Pro Thr 

1925 1930 1935 

His Tyr Val Thr Glu Ser Asp Ala Ser Gin Arg Val Thr Gin Leu Leu 

1940 1945 1950 

Gly Ser Leu Thr He Thr Ser Leu Leu Arg Arg Leu His Asn Trp He 

1955 I960 1965 

Thr Glu Asp Cys Pro He Pro Cys Ser Gly Ser Trp Leu Arg Asp Val 

1970 1975 1980 

Trp Asp Trp Val Cys Thr He Leu Thr Asp Phe Lys Aan Trp Leu Thr 
1985 1990 1995 2000 

Ser Lys Leu Phe Pro Lys Leu Pro Gly Leu Pro Phe He Ser Cys Gin 

2005 2010 2015 

Lys Gly Tyr Lys Gly Val Trp Ala Gly Thr Gly He Met Thr Thr Arg 

2020 2025 2030 

Cys Pro Cys Gly Ala Asn He Ser Gly Asn Val Arg Leu Gly Ser Met 

2035 2040 2045 

Arg He Thr Gly Pro Lys Thr Cys Met Asn Thr Trp Gin Gly Thr Phe 

2050 2055 2060 

Pro He Asn Cys Tyr Thr Glu Gly Gin Cye Ala Pro Lys Pro Pro Thr 
2065 2070 2075 2080 

Aen Tyr Lys Thr Ala He Trp Arg Val Ala Ala Ser Glu Tyr Ala Glu 

2085 2090 2095 

Val Thr Gin His Gly Ser Tyr Ser Tyr Val Thr Gly Leu Thr Thr Asp 
2100 2105 2110 
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Asn Leu Lye lie Pro Cys Gin Leu Pro Ser Pro Glu Phe Phe Ser Trp 

2115 2120 2125 

Val Asp Gly Val Gin lie His Arg Phe Ala Pro Thr Pro Lys Pro Phe 

2130 2135 2140 

Phe Arg Asp Glu Val Ser Phe Cys Val Gly Leu Aan Ser Tyr Ala Val 
2145 2150 2155 2160 

Gly Ser Gin Leu Pro Cys Glu Pro Glu Pro Asp Ala Asp Val Leu Arg 

2165 2170 2175 

ser Met Leu Thr Aap Pro Pro Hia He Thr Ala Glu Thr Ala Ala Arg 

2180 2185 2190 

Arg Leu Ala Arg Gly Ser Pro Pro Ser Glu Ala Ser Ser Ser Val Ser 

2195 2200 220S 

Gin Leu Ser Ala Pro Ser Leu Arg Ala Thr Cys Thr Thr His Ser Aen 
2210 221B 2220 

Thr Tyr Asp Val Asp Met Val Asp Ala Asn Leu Leu Met Glu Gly Gly 

2225 2230 2235 2240 

val Ala Gin Thr Glu Pro Glu Ser Arg Val Pro Val Leu Asp Phe Leu 

2245 2250 2255 

Glu pro Met Ala Glu Glu Glu Ser Asp Leu Glu Pro Ser He Pro Ser 

2260 2265 2270 

Glu Cy3 Met Leu Pro Arg Ser Gly Phe Pro Arg Ala Leu Pro Ala Trp 

2275 2280 2285 

Ala Arg Pro Asp Tyr Asn Pro Pro Leu Val Glu Ser Trp Arg Arg Pro 

2290 2295 2300 

Asp Tyr Gin Pro Pro Thr Val Ala Gly Cys Ala Leu Pro Pro Pro Lys 
2305 2310 2315 2320 

Lys Ala Pro Thr Pro Pro Pro Arg Arg Arg Arg Thr Val Gly Leu Ser 

2325 2330 2335 

Glu ser Thr He Ser Glu Ala Leu Gin Gin Leu Ala He Lys Thr Phe 

2340 2345 2350 

Gly Gin pro Pro Ser Ser Gly Asp Ala Gly Ser Ser Thr Gly Ala Gly 
2355 2360 2365 
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Ala Ala 61u Ser Gly Gly Pro Thr Ser Pro Gly Glu Pro Ala Pro Ser 

2370 2375 2380 

Glu Thr Gly Ser Ala Ser Ser Met Pro Pro Leu Glu Gly Glu Pro Gly 
2385 2390 2395 2400 

Asp Pro Asp Leu Glu Ser Asp Gin Val Glu Leu Gin Pro Pro Pro Gin 

2405 2410 2415 

Gly Gly Gly Val Ala Pro Gly Ser Gly Ser Gly Ser Trp Ser Thr Cys 

2420 2425 2430 

Ser Glu Glu Asp Asp Thr Thr Val Cys Cys Ser Met Ser Tyr Ser Trp 

2435 2440 2445 

Thr Gly Ala Leu lie Thr Pro Cys Ser Pro Glu Glu Glu Lys Leu Pro 

2450 2455 2460 

lie Asn Pro Leu Ser Asn Ser Leu Leu Arg Tyr His Asn Lys Val Tyr 
2465 2470 2475 2480 

Cys Thr Thr Ser Lys Ser Ala Ser Gin Arg Ala Lys Lys Val Thr Phe 

2485 2490 2495 

Asp Arg Thr Gin Val Leu Asp Ala His Tyr Asp Ser Val Leu Lys Aap 

2500 2505 2510 

He Lys Leu Ala Ala Ser Lys Val Ser Ala Arg Leu Leu Thr Leu Glu 

2515 2520 2525 

Glu Ala Cys Gin Leu Thr Pro Pro His Ser Ala Arg Ser Lys Tyr Gly 

2530 2535 2540 

Phe Gly Ala Lys Glu Val Arg Ser Leu Ser Gly Arg Ala Val Asn His 
2545 2550 2555 2560 

He Lys Ser val Trp Lys Asp Leu Leu Glu Asp Pro Gin Thr Pro He 

2565 2570 2575 

Pro Thr Thr He Met Ala Lys Asn Glu Val Phe Cys val Asp Pro Ala 

2580 2585 2590 

Lys Gly Gly Lys Lys Pro Ala Arg Leu He Val Tyr Pro Asp Leu Gly 

2S95 2600 2605 

Val Arg Val Cys Glu Lys Met Ala Leu Tyr Asp He Thr Gin Lys Leu 
2610 2615 2620 
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Pro Gin Ala Val Met Gly Ala Ser Tyr Gly Phe Gin Tyr Ser Pro Ala 

262S 2630 2635 2640 

Gin Arg Val Glu Tyr Leu Leu Lys Ala Trp Ala Glu Lys Lys Aap Pro 

2645 2650 2655 

Met Gly Phe Ser Tyr Asp Thr Arg Cya Phe Asp Ser Thr Val Thr Glu 

2660 2665 2670 

Arg Asp lie Arg Thr Glu Glu Ser lie Tyr Gin Ala Cys Ser Leu Pro 

2675 2680 2685 

Glu Glu Ala Arg Thr Ala He His Ser Leu Thr Glu Arg Leu Tyr Val 

2690 269S 2700 

Gly Gly Pro Met Phe Asn 3er Lys Gly Gin Thr Cys Gly Tyr Arg Arg 

2705 2710 2715 2720 

Cys Arg Ala Ser Gly Val Leu Thr Thr Ser Met Gly Aen Thr He Thr 

2725 2730 2735 

Cys Tyr Val Lye Ala Leu Ala Ala Cys Lys Ala Ala Gly He Val Ala 

2740 2745 2750 

Pro Thr Met Leu Val Cys Gly Asp Asp Leu Val Val lie Ser Glu Ser 

2755 2760 2765 

Gin Gly Thr Glu Glu Asp Glu Arg Asn Leu Arg Ala Phe Thr Glu Ala 

2770 2775 2780 

Met Thr Arg Tyr Ser Ala Pro Pro Gly Asp Pro Pro Arg Pro Glu Tyr 

2785 2790 2795 2800 

Asp Leu Glu Leu He Thr Ser Cys Ser Ser Asn Val Ser Val Ala Leu 

2805 2810 2815 

Gly Pro Arg Gly Arg Arg Arg Tyr Tyr Leu Thr Arg Asp Pro Thr Thr 

2820 2825 2830 

Pro Leu Ala Arg Ala Ala Trp Glu Thr Val Arg His Ser Pro He Asn 

2835 2840 2845 

Ser Trp Leu Gly Asn He He Gin Tyr Ala Pro Thr He Trp Val Arg 

2850 2855 2860 

Met Val Leu Met Thr His Phe Phe Ser He Leu Met Val Gin Asp Thr 

2865 2870 2875 2880 
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Leu Asp Gin Asn Leu Asn Phe Glu Met Tyr 61y Ser Val Tyr Ser Val 

2885 2890 2895 

Aan Pro Leu Asp Leu Pro Ala lie lie Glu Arg Leu His Gly Leu Asp 

2900 2905 2910 

Ala Phe Ser Met His Thr Tyr Ser His His Glu Leu Thr Arg Val Ala 

2915 2920 2925 

Ser Ala Leu Arg Lys Leu Gly Ala Pro Pro Leu Arg Val Trp Lya Ser 

Z930 2935 2940 

Arg Ala Arg Ala Val Arg Ala Ser Leu He Ser Arg Gly Gly Lys Ala 
2945 2950 2955 2960 

Ala Val Cye Gly Arg Tyr Leu Phe Asn Trp Ala Val Lys Thr Lys Leu 

2965 2970 2975 

Lya Leu Thr Pro Leu Pro Glu Ala Arg Leu Leu Asp Leu Ser Ser Trp 

2980 2985 2990 

Phe Thr Val Gly Ala Gly Gly Gly Asp He Phe His Ser Val Ser Arg 

2995 3000 300S 

Ala Arg Pro Arg Ser Leu Leu Phe Gly Leu Leu Leu Leu Phe Val Gly 

3010 3015 3020 

Val Gly Leu Phe Leu Leu Pro Ala Arg 
3025 3030 



<210> 5 
<211> 9674 
<212> DNA 

<213> Hepatitis C virus 

<220> 

<221> CDS 

<222> (341) . . (9442) 

<400> 5 
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acccgcccct aataggggcg acactccgcc atgaatcact cccctgtgag gaactactgt 60 

cttcacgcag aaagcgtcta gccatggcgt tagtatgagt gtcgtacagc ctccaggccc 12 0 

ccccctcccg ggagagccat agtggtctgc ggaaccggtg agtacaccgg aattgccggg 180 

aagactgggt cctttcttgg ataaacccac tctatgcccg gcaatttggg cgtgcccccg 240 

caagactgct agccgagtag cgttgggttg cgaaaggcct tgtggtactg cctgataggg 300 

tgcttgcgag tgcccoggga ggtctcgtag accgtgcacc atg age aca aat ccc 355 
Met Ser Thr Asn Pro 



aaa cct caa aga aaa acc aaa aga aac act aac cgt cgc cca caa gac 4 03 
Lye Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn Arg Arg Pro Gin Asp 



gtt aag ttt ccg ggc ggc ggc cag ate gtt ggc gga gta tac ttg ttg 4 51 
Val Lys Phe Pro Gly Gly Gly Gin He Val Gly Gly Val Tyr Leu Leu 



ccg cgc agg ggc ccc agg ttg ggt gtg cgc gcg aca agg aag get teg 499 
Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala Thr Arg Lys Ala Ser 



gag egg tec eag cca cgt ggg agg cgc cag ccc ate ccc aaa cat egg 547 
Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro lie Pro Lys His Arg 



cgc tec act ggc aag tec tgg ggg aag cca gga tac ccc tgg ccc ctg 595 
Arg Ser Thr Gly Lys Ser Trp Gly Lys Pro Gly Tyr Pro Trp Pro Leu 
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tat ggg aafc gag ggg etc ggt tgg gca gga tgg etc ctg tec cct cga 643 

Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp Leu Leu Ser Pro Arg 
90 9B 100 

ggt tec cgt ccc tea tgg ggc ccc aat gac ccc egg cat agg teg egc 691 

Gly ser Arg Pro Ser Trp Gly Pro Asn Asp Pro Arg His Arg Ser Arg 
105 110 115 

aat gtg ggt aag gtc ate gat aco eta aeg tge ggc ttt gee gac etc 739 

Asn val Gly Lys Val lie Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
120 125 130 

ttg ggg tac gtc ccc gtc gta ggc gee eeg ett agt ggc gtt gcc agt 787 

Leu Gly Tyr Val Pro Val Val Gly Ala Pro Leu Ser Gly Val Ala Ser 
135 140 145 

get etc gcg cac ggc gtg aga gtc ctg gag gac ggg gtt aat ttt gca 835 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Phe Ala 

ISO 155 160 165 

aca ggg aac tta cct ggt tge tec ttt tct ate ttc ttg ctg gcc eta 883 

Thr Gly Asn Leu Pro Gly Cys Ser phe Ser lie Phe Leu Leu Ala Leu 
170 175 180 

ctg tec tge ate act act ccg gtc tct get gtc caa gtg aag aae ace 931 

Leu Ser Cys lie Thr Thr Pro Val Ser Ala Val Gin Val Lys Asn Thr 
IBS 190 195 

age aac gcc tat atg gcg act aac gac tgt tec aat gac age ate act 979 

Ser Asn Ala Tyr Met Ala Thr Asn Asp Cys Ser Asn Asp Ser lie Thr 
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tgg cag ctt gag gcc gca gtc etc cat gtc ccc ggg tgc gtc ccg tgc 1027 
Trp Gin Leu Glu Ala Ala Val Leu His Val Pro Gly Cys Val Pro Cys 
215 220 225 

gag aaa atg ggg aac aca tea egg tgc tgg ata cca gtc tea cca aac 1075 
Glu Lys Met Gly Aan Thr Ser Arg Cys Trp He Pro Val Ser Pro Asn 
230 235 240 245 

gtg get gtg egg cag ect ggc gcc etc acg egg ggc ttg egg acg cac 1123 
Val Ala Val Arg Gin Pro Gly Ala Leu Thr Arg Gly Leu Arg Thr His 
250 255 260 

ate gac atg gtc gtg ttg tec gcc acg etc tgc tee get etc tac gtg 1171 
He Asp Met Val Val Leu Ser Ala Thr Leu Cys Ser Ala Leu Tyr Val 
265 270 275 

ggg gac etc tgt ggc ggg gtg atg etc gcg tec cag atg ttc att gtc 1219 
Gly Asp Leu Cys Gly Gly Val Met Leu Ala Ser Gin Met Phe He Val 
280 285 290 

teg ccg cag cac cac tgg ttc gtg cag gaa tgc aat tgc tec ate tac 1267 
Ser Pro Gin His His Trp Phe Val Gin Glu Cys Aan Cys Ser He Tyr 
295 300 305 

cct ggc gcc ate act ggg cac cgt atg gca tgg gac atg atg atg aac 1315 
Pro Gly Ala He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn 
310 315 320 325 

tgg teg ccc acg acc ace atg ate ctg gcg tac gtg atg cgc gtt ccc 1363 
Trp Ser Pro Thr Thr Thr Met He Leu Ala Tyr Val Met Arg Val Pro 
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gag gtc ate ata gac ate att age gga get cac tgg ggc gtc atg ttt 1411 
Glu VaX He He Asp He He Ser Gly Ala His Trp 61y Val Met Phe 
345 350 355 

ggc etg gee tae ttc tct atg eag gga gcg tgg gcg aag gtc gtt gtc 1459 
Gly Leu Ala Tyr Phe Ser Met Gin Gly Ala Trp Ala Lya Val Val Val 
360 365 370 

ate etc ctg ctg gcc tct ggg gtg gac gcg tac acc acc acg act ggg 1507 
He Leu Leu Leu Ala Ser Gly Val Asp Ala Tyr Thr Thr Thr Thr Gly 
375 380 385 

age get get ggg cgc act acc agt age ctg gcc age gcc ttc tec cct 1S55 
Ser Ala Ala Gly Arg Thr Thr Ser Ser Leu Ala Ser Ala Phe Ser Pro 
390 395 400 405 

ggc get egg cag aac att cag etc att aat acc aat ggt age tgg cac 1603 
Gly Ala Arg Gin Aan He Gin Leu He Asn Thr Aan Gly Ser Trp His 
410 415 420 

ate aac cgc acc gcc ctg aat tgc aac gat tec ttg cac acc ggc ttc 1651 
He Asn Arg Thr Ala Leu Aan Cys Asn Aap Ser Leu Hia Thr Gly Phe 
425 430 435 

ttc aeg gee etg ttc tac ate cat aag ttc aac teg teg gga tgt ccc 1699 
Phe Thr Ala Leu Phe Tyr He His Lys Phe Asn Ser Ser Gly Cys Pro 
440 445 450 

gag cgc etg tec gcc tgt cgc aac ate gag gac ttc egg ata gga tgg 174 7 
Glu Arg Leu Ser Ala Cys Arg Asn He Glu Aap Phe Arg He Gly Trp 
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ggc gcc ctg caa tac gac gac aat gtc acc aat cca gaa gat atg agg 1795 

Gly Ala Leu Gin Tyr Aep Asp Asn Val Thr Aan Pro Glu Asp Met Arg 
470 475 480 485 

cca tat tgc tgg cac tac cca cca aaa cag tgt ggc gta gtc ccc goa 1843 

Pro Tyr Cys Trp Hia Tyr Pro Pro Lys Gin Cye Gly Val Val Pro Ala 

490 495 SOO 

ggg acc gtg tgc ggc cca gtg tac tgt ttc acc cot age ccg gtg gta 1891 

Gly Thr val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro Val Val 

SOB 510 515 

gtg ggc acg acc gat aga ctt gga gtg cct act tac acg tgg gga gag 193 9 

Val Gly Thr Thr Asp Arg Leu Gly Val Pro Thr Tyr Thr Trp Gly Glu 

520 525 530 

aat gag aca gat gtc ttc eta ttg aac age acc cga cca ccg teg ggg 1987 

Asn Glu Thr Asp Val Phe Leu Leu Asn Ser Thr Arg Pro Pro Ser Gly 
535 540 545 

tea tgg ttt ggc tgc acg tgg atg aac tcc act ggc ttc ace aag acc 2035 

Ser Trp Phe Gly Cys Thr Trp Met Asn Ser Thr Gly Phe Thr Lys Thr 
550 555 560 S6S 

tgc ggc gca cca ccc tgc cgc act aga get gac ttc aat acc age aca 2083 

Cys Gly Ala Pro Pro Cys Arg Thr Arg Ala Aap Phe Asn Thr Ser Thr 

570 575 B80 

gat ctg ttg tgc ccc acg gac tgt ttt aga aaa eat cct gaa gcc act 2131 

Asp Leu Leu Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala Thr 
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tac ate aaa tgt ggt tec ggg cct tgg etc acg cca aag tgt ctg gtt 2179 

Tyr lie Lys Cya Gly Ser Gly Pro Trp Leu Thr Pro Lye Cys Leu Val 
600 605 610 

gac tac ccc tac agg etc tgg cat tac cct tgc aca gtc aat tac tec 2227 

Asp Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Tyr Ser 
615 620 625 

acc ttc aag ate agg atg tat gtg ggg gga gtt gag cac agg etc atg 2275 

Thr Phe Lys He Arg Met Tyr Val Gly Gly Val Glu His Arg Leu Met 
630 635 640 645 

gcc gcg tgc aat ttc act cgt ggg gat cgc tgc aac ttg gag gat agg 2323 

Ala Ala Cys Asn Phe Thr Arg Gly Asp Arg Cys Asn Leu Glu Asp Arg 
650 655 660 

gac aga agt caa cag act cct ctg ttg cac tec acc acg gaa tgg gcc 2371 

Asp Arg Ser Gin Gin Thr Pro Leu Leu His Ser Thr Thr Glu Trp Ala 
665 670 675 

att ttg ccc tgc tct ttc tea gac ttg cce get ttg teg act ggt ctt 2419 

He Leu Pro Cys Ser Phe Ser Asp Leu Pro Ala Leu Ser Thr Gly Leu 
680 685 690 

etc cac etc cac caa aat ate gtg gac gta caa tat atg tat ggc ctg 2467 

Leu His Leu His Gin Asn He Val Asp Val Gin Tyr Met Tyr Gly Leu 
695 700 705 

tea cct gcc etc aca caa tat ate gtt cga tgg gag tgg gta gta cte 2515 

Ser Pro Ala Leu Thr Gin Tyr He Val Arg Trp Glu Trp Val Val Leu 
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tta ttc ctg etc eta gcg gac gcc agg gtc tgc gcc tgc ttg tgg atg 2563 

Leu Phe Leu Leu Leu Ala Asp Ala Arg Val Cys Ala Cys Leu Trp Met 

730 735 740 

ete ate ttg ctg ggc caa gcc gaa gca gca ctg gag aag ctg gtc gtc 2611 

Leu He Leu Leu Gly Gin Ala Glu Ala Ala Leu Glu Lys Leu Val Val 

74S 750 755 

ttg cac get gcg age gca get age tgc aat ggc ttc ctg tat ttt gtc 2659 

Leu His Ala Ala Ser Ala Ala Ser Cys Asn Gly Phe Leu Tyr Phe Val 
760 765 770 

ate ttt etc gtg get get tgg cac ate aag ggt agg gtg gtc ccc ttg 2707 

lie Phe Leu Val Ala Ala Trp His He Lys Gly Arg val Val Pro Leu 

775 780 785 

get get tat tec ctt act ggc ctg tgg ccg ttc tgc eta ctg etc eta 2755 

Ala Ala Tyr Ser Leu Thr Gly Leu Trp Pro Phe Cye Leu Leu Leu Leu 

790 795 800 805 

gca ctg ecc cag cag get tac gcc tat gat gca tct gtg cac gga cag 2803 

Ala Leu Pro Gin Gin Ala Tyr Ala Tyr Asp Ala Ser Val His Gly Gin 

810 815 820 

gtg ggc gcg get ttg eta gta ctg att ace etc ttt aca etc acc ccg 2851 

Val Gly Ala Ala Leu Leu Val Leu He Thr Leu Phe Thr Leu Thr Pro 

825 830 835 

ggg tat aag acc ctt etc age cag tcc Ctg tgg tgg ttg tgc tat etc 2899 

Gly Tyr Lys Thr Leu Leu Ser Gin Ser Leu Trp Trp Leu Cys Tyr Leu 
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ctg acc ctg gcg gaa acc atg gtc cag gag tgg gca cca tec atg cag 2 947 

Leu Thr Leu Ala Glu Thr Met Val Gin 61u Trp Ala Pro Ser Met Gin 
855 860 865 

gcg cgc ggc ggc cgt gat ggc ate ata tgg gcc gcc acc ata ttt tgc 2995 

Ala Arg Gly Gly Arg Aap Gly He He Trp Ala Ala Thr He Phe Cye 
870 875 880 885 

ccg ggc gta gtg ttt gac ata acc aag tgg etc tta gcg gtg ctt ggg 3043 

Pro Gly Val Val Phe Aap He Thr Lys Trp Leu Leu Ala Val Leu Gly 
890 895 900 

Gct ggt tac etc eta aga ggt get ttg acg cgc gtg cca tat ttc gtc 3091 

Pro Gly Tyr Leu Leu Arg Gly Ala Leu Thr Arg Val Pro Tyr Phe val 
90S 910 915 

aga gcc cac gct ctg etg aga atg tgc act atg gtg agg cac etc gcg 3139 

Arg Ala His Ala Leu Leu Arg Met Cye Thr Met Val Arg His Leu Ala 
920 925 930 

39t agg tac gtc cag atg gcg eta tta gcc ctt ggc agg tgg act 3187 

Gly Gly Arg Tyr Val Gin Met Ala Leu Leu Ala Leu Gly Arg Trp Thr 
935 940 945 

ggc act tac ate tat gac cac etc acc cct atg teg gat tgg gct gct 3235 

Gly Thr Tyr He Tyr Asp His Leu Thr Pro Met Ser Asp Trp Ala Ala 
950 955 960 965 

age ggc ctg egg gac ttg gcg gtc gct gtg gag cct ate ate ttc agt 3283 

Ser Gly Leu Arg Asp Leu Ala Val Ala Val Glu Pro He He Phe Ser 
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ccg atg gag aag aaa gtc ate gtt tgg gga gcg gag acg get gcg tgc 3331 

Pro Met Glu Lys Lys Val lie Val Trp Gly Ala Glu Thr Ala Ala Cys 
985 990 995 

ggg gac atc ttg cac gga ctt ccc gtg tec gcc cga etc ggt egg gag 3379 

Gly Aap He Leu His Gly Leu Pro Val Sex Ala Arg Leu Gly Arg Glu 
1000 1005 1010 

atc etc ctt ggc cca get gat gge tao acc tec aag ggg tgg aag ctt 3427 

He Leu Leu Gly Pro Ala Asp Gly Tyr Thr Ser Lya Gly Trp Lys Leu 
1015 1020 1025 

etc gcc ccc atc acc get tae gee cag cag aca cga ggt etc ttg ggc 347 5 

Leu Ala Pro He Thr Ala Tyr Ala Gin Gin Thr Arg Gly Leu Leu Gly 
1030 1035 1040 1045 

tct ata gtg gtg age a'tg acg ggg cgt gac aag aca gaa cag gcc ggg 3S23 

ser He Val Val Ser Met Thr Gly Arg Asp Lys Thr Glu Gin Ala Gly 
1050 1055 1060 

gag gtc caa gtc ctg tec aca gte act cag tec ttc etc gga aca tec 3 571 

Glu Val Gin val Leu Ser Thr Val Thr Gin Ser Phe Leu Gly Thr Ser 
1065 1070 1075 

att teg ggg gtc tta tgg act gtt tac cac gga get ggc aac aag aca 3619 

He Ser Gly Val Leu Trp Thr Val Tyr His Gly Ala Gly Asn Lys Thr 
1080 1085 1090 

eta gcc ggc teg egg ggc ccg gtc acg cag atg tac teg age gcc gag 3667 

Leu Ala Gly Ser Arg Gly Pro Val Thr Gin Met Tyr Ser Ser Ala Glu 
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1095 1100 1105 

ggg gac ttg gtc ggg tgg ccc age cct cat ggg acc aaa tct ttg gag 3715 
Giy Asp Leu Val Gly Trp Pro Ser Pro Pro Gly Thr Lys Ser Leu Glu 
1110 1115 1120 1125 



ccg tgt acg tgt gga gcg gtc gac ctg tat ttg gtc acg egg aae get 3763 
Pro Cys Thr Cya Gly Ala Val Asp Leu Tyr Leu Val Thr Arg Asn Ala 
1130 1135 1140 



gat gtc ate ccg get cga aga cgc ggg gac aag egg gga gcg ctg etc 3 811 
Asp Val lie Pro Ala Arg Arg Arg Gly Asp Lys Arg Gly Ala Leu Leu 
1145 1150 1155 



tee ccg aga ccc ctt teg acc ttg aag ggg tec teg ggg gga cct gtg 38S9 
Ser Pro Arg Pro Leu Ser Thr Leu Lys Gly Ser Ser Gly Gly Pro Val 
1160 1165 1170 



ctt tgc cct agg ggc cac get gtc gga ate tte egg gea get gtg tgc 3907 
Leu Cys Pro Arg Gly His Ala Val Gly lie Phe Arg Ala Ala Val Cys 
1175 1180 1185 



tct egg ggt gtg get aag tec ata gat ttc ate ccc gtt gag acg etc 3955 
Ser Arg Gly Val Ala Lya Ser lie Asp Phe lie Pro Val Glu Thr Leu 
1190 1195 1200 1205 



gac ate gtc acg egg tct ccc acc ttt agt gac aae age aca cca cca 
Asp He Val Thr Arg Ser Pro Thr Phe Ser Asp Asn Ser Thr Pro Pro 
1210 1215 1220 



get gtg ece cag acc tat cag gtg ggg tac ttg cac gcc ccc act ggc 4051 
Ala val Pro Gin Thr Tyr Gin Val Gly Tyr Leu His Ala Pro Thr Gly 
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agt gga aaa age acc aag gtc ccc gtc gcg tac gcc gcc cag ggg tat 4099 
Ser Gly Lys Ser Thr Lys Val Pro Val Ala Tyr Ala Ala Gin Gly Tyr 
1240 1245 1250 

aaa gtg ctg gtg etc aat ccc teg gtg get gcc acc ctg gga ttt ggg 414 7 
Lys Val Leu Val Leu Asn Pro Ser Val Ala Ala Thr Leu Gly Phe Gly 
1255 1260 1265 

gcg tac ttg tec aag gca cat ggc ate aac ccc aac att agg act gga 4195 
Ala Tyr Leu Ser Lys Ala His Gly He Asn Pro Asn He Arg Thr Gly 
1270 1275 1280 1285 

gtc aga act gtg acg acc ggg gag ccc att aca tac tec acg tat ggt 424 3 
Val Arg Thr Val Thr Thr Gly Glu Pro He Thr Tyr Ser Thr Tyr Gly 
1290 1295 1300 

aaa ttc etc gee gat ggg ggc tgc gca ggc ggo gcc tat gac ate ate 4291 
Lys Phe Leu Ala Asp Gly Gly Cys Ala Gly Gly Ala Tyr Asp He He 
1305 1310 1315 

ata tgc gat gaa tgc cac tct gtg gat get acc act att etc ggo ate 433 9 
He Cys Asp Glu Cya His Ser Val Asp Ala Thr Thr He Leu Gly He 
1320 1325 1330 

ggg aca gtc ctt gac caa gca gag aca gcc ggg gtc agg eta act gta 4387 
Gly Thr Val Leu Asp Gin Ala Glu Thr Ala Gly Val Arg Leu Thr val 
1335 - 1340 1345 

Ctg gcc acg gcc acg ccc ccc ggg teg gtg aca acc ccc cat ccc aat 4435 
Leu Ala Thr Ala Thr Pro Pro Gly Ser Val Thr Thr Pro His Pro Asn 
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ata gag gag gta gcc etc gga cag gag ggt gag ate ccc ttc tat ggg 4483 
He Glu Glu Val Ala Leu Gly Gin Glu Gly Glu He Pro Phe Tyr Gly 
1370 1375 1380 

agg gcg ttt ccc ctg tct tac ate aag gga ggg agg cac ttg att ttc 4 531 
Arg Ala Phe Pro Leu Ser Tyr He Lys Gly Gly Arg His Leu He Phe 
1385 1390 1395 

tgc cac tea aag aaa aag tgt gac gag etc gca acg gcc ctt egg ggc 4 57 9 
Cys His Ser Lys Lys Lys Cys Asp Glu Leu Ala Thr Ala Leu Arg Gly 
1400 1405 1410 

atg ggc ttg aac get gtg gea tat tac aga ggg ttg gac gtc tec ata 4 627 
Met Gly Leu Asn Ala Val Ala Tyr Tyr Arg Gly Leu Asp Val Ser He 
1415 1420 1425 

ata cca act caa gga gat gtg gtg gtc gtt gcc acc gac gcc etc atg 4675 
He Pro Thr Gin Gly Asp Val Val Val Val Ala Thr Asp Ala Leu Met 
1430 1435 1440 1445 

acg ggg tat act gga gac ttt gac tec gtg ate gac tgc aac gta gcg 4723 
Thr Gly Tyr Thr Gly Asp Phe Asp Ser Val He Asp Cys Asn Val Ala 
1450 1455 1460 

gtc acc cag gee gta gac ttc age ctg gac ccc acc ttc act ata acc 4 771 
Val Thr Sin Ala Val Asp Phe Ser Leu Aap Pro Thr Phe Thr He Thr 
1465 1470 1475 

aca cag act gtc ccg caa gac get gtc tea cgt agt cag cgc cga ggg 4819 
Thr Gin Thr Val Pro Gin Asp Ala Val Ser Arg Ser Gin Arg Arg Gly 
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cgc acg ggt aga gga aga ctg ggc att tat agg tat gtt tec act ggt 4867 
Arg Thr 61y Arg Gly Arg Leu Gly lie Tyr Arg Tyr Val Ser Thr Gly 
1495 1500 150S 

gag cga gcc tea gga atg ttt gac agt gta gta etc tgt gag tgc tac 4915 
Glu Arg Ala Ser Gly Met Phe Asp Ser Val Val Leu Cys Glu Cys Tyr 
1510 1515 1520 1525 

gac gca gga get get tgg tat gag etc tea cca gtg gag acg ace gtc 4963 
Asp Ala Gly Ala Ala Trp Tyr Glu Leu Ser Pro Val Glu Thr Thr Val 
1530 1535 1540 

agg etc agg gcg tat ttc aac acg cct ggc ttg cct gtg tgc cag gac 5011 
Arg Leu Arg Ala Tyr Phe Asn Thr Pro Gly Leu Pro Val Cys Gin Asp 
1545 1550 1555 

cac ctt gag ttt tgg gag gea gtt ttc acc ggc etc aca cac ata gac 5059 
His Leu Glu Phe Trp Glu Ala Val Phe Thr Gly Leu Thr His lie Asp 
1560 1565 1570 

get cat ttc ctt tec cag aca aag cag teg ggg gaa aat ttc gca tac 5107 
Ala His Phe Leu Ser Gin Thr Lys Gin Ser Gly Glu Aan Phe Ala Tyr 
1575 1580 1585 

tta gta gcc tat cag gcc aca gtg tgc gcc agg gcc aaa gcg ccc ccc 5155 
Leu Val Ala Tyr Gin Ala Thr Val Cys Ala Arg Ala Lys Ala Pro Pro 
1590 1595 1600 1605 

ccg tec tgg gac gtc atg tgg aag tgc ttg act cga etc aag ccc acg 5203 
Pro Ser Trp Asp Val Met Trp Lys Cys Leu Thr Arg Leu Lys Pro Thr 
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ctt gtg ggc cct aca cct etc ctg tac cgt ttg ggc tct gtt acc aac S251 

Leu Val Gly Pro Thr Pro Leu Leu Tyr Arg Leu Gly Ser Val Thr Asn 
1625 1630 1635 

gag gtc acc ctt aca cac ccc gtg aca aaa tac ate gcc aca tgc atg 5299 

Glu VaX Thr Leu Thr His Pro Val Thr Lys Tyr He Ala Thr Cys Met 
1640 1645 1650 

caa get gac etc gag gtc atg acc age acg tgg gtc ctg get ggg gga 534 7 

Sin Ala Aap Leu Glu Val Met Thr Ser Thr Trp Val Leu Ala Gly Gly 
1655 1660 1665 

gtc tta gea gcc gtc gcc gcg tat tgc tta geg acc ggg tgt gtt tec 5395 

Val Leu Ala Ala Val Ala Ala Tyr Cys Leu Ala Thr Gly Cys Val Ser 
1670 1675 1680 1685 

ate att ggc cgt tta cac ate aac cag ega get gtc gtc get ccg gac 5443 

He He Gly Arg Leu His He Asn Gin Arg Ala Val Val Ala Pro Asp 
1690 1695 1700 

aag gag gtc etc tat gag get ttt gat gag atg gag gaa tgt gcc tec 54 91 

Lys Glu Val Leu Tyr Glu Ala Phe Asp Glu Met Glu Glu Cys Ala Ser 
1705 1710 1715 

aga gcg get etc ctt gaa gag ggg cag egg ata gcc gag atg ctg aag 553 9 

Arg Ala Ala Leu Leu Glu Glu Gly Gin Arg He Ala Slu Met Leu Lys 
1720 1725 1730 

tec aag ate caa ggc tta ttg cag caa gcc tct aaa cag gcc cag gac 5587 

Ser Lys He Gin Gly Leu Leu Gin Gin Ala Ser Lys Gin Ala Gin Asp 
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ata caa ccc get gtg caa get teg tgg ccc aag atg gag caa ttc tgg 5635 
lie Gin Pro Ala Val Gin Ala Ser Trp Pro Lys Met Glu Gin Phe Trp 
1750 1755 1760 1765 

gee aaa cat atg tgg aac ttc ata age ggc att cag tac cte gea gga 5683 
Ala Lys His Met Trp Asn Phe lie Ser Gly lie Gin Tyr Leu Ala Gly 
1770 1775 1780 

ctg tea aca ctg cca ggg aac cct get gtg get tec atg atg gca ttc 5731 
Leu Ser Thr Leu Pro Gly Asn Pro Ala Val Ala Ser Met Met Ala Phe 
1785 1790 1795 

age gee gee etc acc agt ccg ttg tea act age ace aec ate ctt ctt 5779 
Ser Ala Ala Leu Thr Ser Pro Leu Ser Thr Ser Thr Thr He Leu Leu 
IBOO 1805 1810 

aac att etg ggg ggc tgg ctg gcg tec caa att gcg cca cce gcg ggg 5827 
Asn He Leu Gly Gly Trp Leu Ala Ser Gin lie Ala Pro Pro Ala Gly 
1815 1820 1825 

gee act ggc ttt gtt gtc agt ggc ctg gtg gga get get gtt ggc age 5875 
Ala Thr Gly Phe Val Val Ser Gly Leu Val Gly Ala Ala Val Gly Ser 
1830 1835 1840 1B45 

ata ggc ttg ggt aaa gtg ctg gtg gac ate ctg gca ggg tat ggt gcg 5923 
He Gly Leu Gly Lys Val Leu Val Asp He Leu Ala Gly Tyr Gly Ala 
1850 1855 1860 

ggc att teg ggg gcc etc gtc gcg ttt aag ate atg tct ggc gag aag 5971 
Gly He Ser Gly Ala Leu Val Ala Phe Lys He Met Ser Gly Glu Lys 
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ccc tec atg gag gat gtc ate aac ttg ctg cct ggg att ctg tct cca 6019 

Pro Ser Met Glu Asp Val lie Asn Leu Leu Pro Gly lie Leu Ser Pro 

1880 1885 1890 

ggt get ctg gtg gtg gga gtc ate tgc gcg gcc att ctg cgc cgc cat 6067 

Gly Ala Leu Val Val Gly Val lie Cys Ala Ala He Leu Arg Arg His 

1895 1900 1305 

gtg gga ccg ggg gaa ggc gcg gtc caa tgg atg aac agg ctt ate gcc 6115 

Val Gly Pro Gly Glu Gly Ala Val Gin Trp Met Asn Arg Leu He Ala 

1910 1915 1920 1925 

ttc get tec aga gga aac cac gtc gcc cct act cac tac gtg acg gag 6163 

Phe Ala Ser Arg Gly Asn His Val Ala Pro Thr His Tyr Val Thr Glu 

1930 1935 1940 

teg gat gcg teg cag cgt gtc acc caa ctg ctt ggc tct etc act ata 6211 

Ser Asp Ala Ser Gin Arg Val Thr Qln Leu Leu Gly Ser Leu Thr He 

1945 1950 1955 

act agt eta etc agg aga ctt cac aac tgg ate act gag gat tgc ccc 6259 

Thr Ser Leu Leu Arg Arg Leu His Asn Trp He Thr Glu Asp Cys Pro 

1960 1965 1970 

ato cca tgc gcc ggc teg tgg etc cgc gat gtg tgg gac tgg gtc tgt 6307 

He Pro Cys Ala Gly Ser Trp Leu Arg Asp Val Trp Asp Trp Val Cys 

1975 1980 1985 

acc ate eta aca gac ttt aag aac tgg ctg acc tec aag ctg ttc cca 6355 

Thr He Leu Thr Asp Phe Lys Asn Trp Leu Thr Ser Lys Leu Phe Pro 
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aag atg cct ggc etc ccc ttt ate tct tgc caa aag ggg tac aag ggc 6403 
Lys Met Pro Gly Leu Pro Phe lie Ser Cye Gin Lye Gly Tyr Lys Gly 
2010 2015 2020 

gtg tgg gcc ggc act ggc ate atg acc aca cga tgc ccc tgc ggc gcc 64S1 
val Trp Ala Gly Thr Gly lie Met Thr Thr Arg Cys Pro Cys Gly Ala 
2025 2030 2035 

aac ate tct ggc aac gtc cgc ttg ggc tct atg aga ate aca gga ccc 6499 
Asn lie Ser Gly Asn Val Arg Leu Gly Ser Met Arg He Thr Gly Pro 
2040 2045 2050 

aaa acc tgc atg aac aac tgg cag ggg acc ttt cct ate aat tgt tat 654 7 
Lys Thr Cye Met Asn Thr Trp Gin Gly Thr Phe Pro He Asn Cys Tyr 
2055 2060 2065 

aca gaa ggc cag tgc ttg ccg aaa ccc gcg tta aac ttc aag acc gcc 6595 
Thr Glu Gly Gin Cys Leu Pro Lys Pro Ala Leu Asn Phe Lys Thr Ala 
2070 2075 2080 2085 

ate tgg aga gtg gcg gcc tea gag tac gcg gaa gtg acg cag cac gga 664 3 
He Trp Arg Val Ala Ala Ser Glu Tyr Ala Glu Val Thr Gin His Gly 
2090 2095 2100 

tea tat gcc tat ata aca ggg ctg acc act gac aac tta aaa gtc cct 6691 
Ser Tyr Ala Tyr He Thr Gly Leu Thr Thr Asp Asn Leu Lys Val Pro 
210S 2110 2115 

tgc caa etc cee tct eea gag ttt ttc tct tgg gtg gac gga gta caa €739 
Cys Gin Leu Pro Ser Pro Glu Phe Phe Ser Trp Val Asp Gly Val Gin 
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ate cat agg tec gcc ccc aca cca aag cog ttt ttc egg gat gag gtc 
He His Arg Ser Ala Pro Thr Pro Lya Pro Phe Phe Arg Asp Glu Val 
2135 2140 2145 

teg ttc age gtt ggg etc aat tea ttt gte gtc ggg tct cag ctt ccc 
Ser Phe Ser Val Gly Leu Asn Ser Phe Val Val Gly Ser Gin Leu Pro 
2150 2155 2160 2165 



tgt gac cct gag ccc gac act gag gta gtg atg tec atg eta aca gac 6883 
Cys Asp Pro Glu Pro Asp Thr Glu Val Val Met Ser Met Leu Thr Asp 
2170 2175 2180 



cca tec cat ate acg gcg gag get gca geg egg cgt tta gcg egg ggg 6931 
Pro ser His He Thr Ala Glu Ala Ala Ala Arg Arg Leu Ala Arg Gly 
2185 2190 2195 



tea ccc cca tct gag gca age tec tea geg age cag ctg teg gcg cca 6979 
Ser Pro Pro Ser Glu Ala Ser Ser Ser Ala Ser Gin Leu Ser Ala Pro 
2200 2205 2210 



teg ctg cga gcc acc tgc acc acc cac ggt agg acc tat gat gtg gac 7027 
Ser Leu Arg Ala Thr Cys Thr Thr His Gly Arg Thr Tyr Asp Val Asp 
2215 2220 2225 



atg gtg gat gcc aac ctg ttc atg ggg ggc ggc gtg att egg ata gag 7075 
Met Val Asp Ala Asn Leu Phe Met Gly Gly Gly Val lie Arg He Glu 
2230 2235 2240 2245 



tct gag tec aaa gtg gtc gtt ctg gac tec etc gac tea atg acc gag 7123 
Ser Glu Ser Lys Val Val Val Leu Asp Ser Leu Asp Ser Met Thr Glu 
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gaa gag ggc gac ctt gag cct tea gta cca teg gag tat atg etc ccc 7171 

Glu Glu Gly Asp beu Glu Pro Ser Val Pro Ser Glu Tyr Met Leu Pro 
2265 2270 2275 

agg aag agg ttc cca ccg gcc tta ccg get tgg gcg egg cct gat tac 7219 

Arg Lys Arg Phe Pro Pro Ala Leu Pro Ala Trp Ala Arg Pro Aap Tyr 
2280 2285 2290 

aac cca ccg ctt gtg gaa teg tgg aag agg cca gat tac caa cca ccc 7267 

Asn Pro Pro Leu Val Glu Ser Trp Lys Arg Pro Asp Tyr Gin Pro Pro 
2295 2300 2305 

act gtt gcg ggc tgt get etc ccc ccc ccc aaa aag acc ccg acg cct 7315 

Thr Val Ala Gly Cys Ala Leu Pro Pro Pro Lys Lys Thr Pro Thr Pro 
2310 2315 2320 2325 

cct cca agg aga cgc egg aca gtg ggt ctg age gag age acc ata gga 73 63 

Pro Pro Arg Arg Arg Arg Thr Val Gly Leu Ser Glu Ser Thr lie Gly 
2330 2335 2340 

gat gcc etc caa cag ctg gcc ate aag tec ttt ggc cag ccc ccc cca 7411 

Asp Ala Leu Gin Gin Leu Ala lie Lys Ser Phe Gly Gin Pro Pro Pro 
2345 2350 2355 

age ggc gat tea ggc ctt tec acg ggg gcg gac gcc gcc gac tec ggc 7459 

Ser Gly Asp Ser Gly Leu Ser Thr Gly Ala Asp Ala Ala Asp Ser Gly 
2360 2365 3370 

gat egg aca ccc cct gac gag ttg get ctt teg gag aca ggt tct ace 7507 

Aap Arg Thr Pro Pro Asp Glu Leu Ala Leu Ser Glu Thr Gly Ser Thr 
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tec tec atg ccc ccc etc gag ggg gag cct ggg gac oca gac ctg gag 7555 
Ser Ser Met Pro Pro Leu Glu Gly Glu Pro Gly Asp Pro Asp Leu Glu 
2390 2395 2400 2405 

cct gag cag gta gag ctt caa cct cct ccc cag ggg ggg gag gca get 7603 
Pro Glu Gin Val Glu Leu Gin Pro Pro Pro Gin Gly Gly Glu Ala Ala 
2410 2415 2420 

CCC ggc teg gac teg ggg tec tgg tct act tgc tec gag gag gat gac 7651 
Pro Gly Ser Asp Ser Gly Ser Trp Ser Thr Cya Ser Glu Glu Asp Asp 
2425 2430 2435 

tec gtc gtg tgc tgc tec atg tea tat tec tgg ace ggg get eta ata 7699 
Ser Val Val Cys Cys Ser Met Ser Tyr Ser Trp Thr Gly Ala Leu lie 
2440 2445 2450 

act cct tgt age ccc gaa gag gaa aag ttg cca att aac tec ttg age 7747 
Thr Pro Cys Ser Pro Glu Glu Glu Lys Leu Pro He Asn Ser Leu Ser 
2455 2460 2465 

aac teg ctg ttg cga tac cat aac aag gta tac tgt act aca tea aag 7793 
Asn Ser Leu Leu Arg Tyr His Asn Lys Val Tyr Cys Thr Thr Ser Lys 
2470 2475 2480 2485 

agt gcc tea eta agg get aaa aag gta act ttt gat agg atg caa gtg 7843 
Ser Ala Ser Leu Arg Ala Lys Lys Val Thr Phe Asp Arg Met Gin Val 
2490 2495 2500 

etc gac gcc tat tat gat tea gtc tta aag gac ate aag eta gcg gcc 7891 
Leu Asp Ala Tyr Tyr Asp Ser Val Leu Lys Asp He Lys Leu Ala Ala 
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tec aag gtc age gca agg etc etc acc tta gag gag gcg tgc caa ttg 793 9 
Ser Lys Val Ser Ala Arg Leu Leu Thr Leu Glu Glu Ala Cys Gin Leu 
2520 2525 2530 

aec cca ccc cac tct gea aga tec aag tat ggg ttt ggg get aag gag 7987 
Thr Pro Pro His Ser Ala Arg Ser Lys Tyr Gly Phe Gly Ala Lys Glu 
2535 2540 2545 

gtc cgc age ttg tec ggg agg gee gtc aac cac ate aag tec gtg tgg 8035 
val Arg Ser Leu Ser Gly Arg Ala Val Asn Bis lie Lys Ser Val Trp 
2550 2555 2560 2565 

aag gac etc ttg gaa gac tea caa aca cca att cct aca acc ate atg 8083 
Lye Asp Leu Leu Glu Asp Ser Gin Thr Pro He Pro Thr Thr He Met 
2570 2575 2580 

gcc aaa aat gag gtg ttc tgc gtg gac ccc gcc aag ggg ggt aaa aaa 8131 
Ala Lys Asn Glu Val Phe Cys Val Asp Pro Ala Lys Gly Gly Lys Lys 
2585 2590 2595 

cca get cgc ctt ate gtt tac cct gac etc ggc gtc agg gtc tgc gag 8179 
Pro Ala Arg Leu He Val Tyr Pro Asp Leu Gly Val Arg Val Cys Glu 
2600 2605 2610 

aag atg gcc ctt tat gat gtc aca caa aag ctt cct cag gcg gtg atg 8227 
Lys Met Ala Leu Tyr Asp Val Thr Gin Lys Leu Pro Gin Ala Val Met 
2615 2620 2625 

ggg get tct tat ggc ttc cag tac tec ccc get cag egg gtg gag ttt 8275 
Gly Ala Ser Tyr Gly Phe Gin Tyr Ser Pro Ala Gin Arg Val Glu Phe 
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etc ttg aag gca tgg gcg gaa aag aga gac cct atg ggt ttt teg tat 8323 
Leu Leu Lye Ala Trp Ala Glu Lys Arg Asp Pro Met Gly Phe Ser Tyr 
2e50 2655 2660 

gat acc cga tgc ttt gac tea ace gtc act gag aga gac ate agg act 8371 
Asp Thr Arg Cys Phe Asp Ser Thr Val Thr Glu Arg Asp lie Arg Thr 
2665 2670 2675 

gag gag toe ata tac eag gcc tgc tec tta ccc gag gag gcc cga act 8419 
Glu Glu Ser lie Tyr Gin Ala Cys Ser Leu Pro Glu Glu Ala Arg Thr 
2680 2685 2690 

gcc ata cac teg ctg act gag aga etc tat gtg gga ggg ccc atg ttc 8467 
Ala He His Ser Leu Thr Glu Arg Leu Tyr Val Gly Gly Pro Met Phe 
2695 2700 2705 

aac age aag gge eag tec tgc ggg tac agg cgt tgc cgc gcc age ggg 8515 
Aan Ser Lys Gly Gin Ser Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly 
2710 2715 2720 2725 

gtg ctt acc act agt atg ggg aac acc ate aca tgc tat gta aaa gcc 8563 
Val Leu Thr Thr Ser Met Gly Asn Thr He Thr Cys Tyr Val Lys Ala 
2730 2735 2740 

eta gcg get tgc aag get gcg ggg ata att gcg ccc acg atg ctg gta 8611 
Leu Ala Ala Cys Lys Ala Ala Gly He He Ala Pro Thr Met Leu Val 
2745 2750 2755 

tgc ggc gac gac ttg gtc gtc ate tea gaa age cag ggg act gag gag 8659 
Cys Gly Asp Asp Leu Val Val He Ser Glu Ser Gin Gly Thr Glu Glu 
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gac gag egg aac ctg aga gcc ttc acg gag get atg acc agg tat tot 8707 

Asp Qlu Arg flsn Leu Arg Ala Phe Thr Glu Ala Met Thr Arg Tyr Ser 

2775 2780 2785 

gcc cct cct ggt gac ccc ccc aga ccg gaa tat gac ctg gag eta ata 8755 

Ala Pro Pro Gly Asp Pro Pro Arg Pro Glu Tyr Asp Leu Glu Leu Tie 

2790 2795 2800 2805 

aca tct tgt tec tea aac gtg tct gtg gca ctt ggc cca cag ggc cgc 8803 

Thr Ser Cys Ser Ser Asn Val Ser Val Ala Leu Gly Pro Gin Gly Arg 

2810 2815 2820 

cgc aga tac tac ctg acc aga gac ccc acc act tea att gee egg get 8851 

Arg Arg Tyr Tyr Leu Thr Arg Asp Pro Thr Thr Ser He Ala Arg Ala 

2825 2830 2835 

gcc tgg gaa aca gtt aga cac tec cct gtc aat tea tgg ctg gga aac 88 9 9 

Ala Trp Glu Thr Val Arg His Ser Pro Val Asn Ser Trp Leu Gly Asn 

2840 2845 2850 

ate ate cag tac get cca acc ata tgg gtt ego atg gtc ctg atg aea 8947 

He He Gin Tyr Ala Pro Thr He Trp val Arg Met Val Leu Met Thr 

2855 2860 2865 

cac ttc ttc tec att etc atg gcc cag gac acc eta gac cag aac ctt 8995 

His Phe Phe Ser He Leu Met Ala Gin Asp Thr Leu Asp Gin Asn Leu 

2870 2875 2880 2865 

aac ttt gaa atg tac gga teg gtg tac tec gtg agt cct ctg gac etc 9043 

Asn Phe Glu Met Tyr Gly Ser Val Tyr Ser Val Ser Pro Leu Asp Leu 
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cca gcc ata att gaa agg tta cac ggg ctt gac gcc tto tct ctg cac 9091 

Pro Ala lie lie Glu Arg Leu His Gly Leu Asp Ala Phe Ser Leu His 
2905 2910 2915 

aca tac act ccc cac gaa ctg acg egg gtg get tea gcc etc aga aaa 913 9 

Thr Tyr Thr Pro Hie Glu Leu Thr Arg Val Ala Ser Ala Leu Arg Lys 
2920 2925 2930 

Ctt ggg gcg cca ccc etc aga gcg tgg aag agt egg gcg cgt gca gtt 9187 

Leu Gly Ala Pro Pro Leu Arg Ala Trp Lys Ser Arg Ala Arg Ala Val 
2935 2940 2945 

agg gcg tec etc ate tec cgt ggg ggg agg gcg gcc gtt tgc ggt egg 9235 

Arg Ala Ser Leu He Ser Arg Gly Gly Arg Ala Ala Val Cys Gly Arg 
2950 2955 2960 2965 

tac etc ttc aac tgg gcg gtg aag acc aag etc aaa etc act cct ttg 9283 

Tyr Leu Phe Asn Trp Ala Val Lys Thr Lys Leu Lys Leu Thr Pro Leu 
2970 2975 2980 

ccg gag gca cgc etc etg gat ttg tec agt tgg ttt acc gtc ggc gee 9331 

Pro Glu Ala Arg Leu Leu Asp Leu Ser Ser Trp Phe Thr Val Gly Ala 
2985 2990 2995 

ggc ggg ggc gac att tat cac age gtg teg cgt gcc cga ccc cgc eta 9379 

Gly Gly Gly Asp lie Tyr His Ser Val Ser Arg Ala Arg Pro Arg Leu 
3000 3005 3010 

tta etc ctt age eta etc eta ctt tct gta ggg gta ggc etc ttc eta 9427 

Leu Leu Leu Ser Leu Leu Leu Leu Ser Val Gly val Gly Leu Phe Leu 
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3015 3020 3025 

etc ccc get cga tag agcggcacac attagctaca otccatagct aactgttcct 9482 

Leu Pro Ala Arg 

3030 

tttttttttt tttttttttt tttttttttt tttttttctt tttttttttt tttccctctt 9542 
tcttcccttc tcatcttatt ctactttctt tcttggtggo tccatctfcag ccctggtcac 9602 
ggctagctgt gaaaggtccg tgagccgcat gaetgcagag agtgccgtaa ctggtctctc 9662 
tgcagatcat gt 9674 

<210> 6 
<211> 3033 
<212> PRT 

•;213> Hepatitis C virus 
<400> 6 

Met Ser Thr Asn Pro Lys Pro 

1 5 
Arg Arg Pro Gin Asp Val Lys 
20 

Gly Val Tyr Leu Leu Pro Arg 
35 40 
Thr Arg Lys Ala Ser Glu Arg 

50 55 
lie Pro Lys His Arg Arg Ser 

65 70 
Tyr Pro Trp Pro Leu Tyr Gly 



Gin Arg Lys Thr Lys Arg Asn Thr Asn 

10 15 
Phe Pro Gly Gly Gly Gin lie Val Gly 

25 30 

Arg Gly Pro Arg Leu Gly Val Arg Ala 

45 

Ser Gin Pro Arg Gly Arg Arg Gin Pro 

60 

Thr Gly Lys Ser Trp Gly Lya Pro Gly 

75 80 
Asn Glu Gly Leu Gly Trp Ala Gly Trp 
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85 90 95 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Asn Asp Pro 

100 105 110 

Arg Hie Arg Ser Arg Asn Val Gly Lys Val lie Asp Thr Leu Thr Cys 

115 120 125 

Gly Phe Ala Asp Leu Leu Sly Tyr Val Pro Val Val Gly Ala Pro Leu 

130 135 140 

Ser Gly val Ala Ser Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
145 IBO 155 160 

Gly Val Asn Phe Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He 

165 170 175 

Phe Leu Leu Ala Leu Leu Ser Cys He Thr Thr Pro Val Ser Ala Val 

180 185 190 

Gin Val Lys Asn Thr Ser Asn Ala Tyr Met Ala Thr Asn Asp Cys Ser 

195 200 205 

Asn Asp Ser He Thr Trp Gin Leu Glu Ala Ala Val Leu Hia Val Pro 

210 215 220 

Gly Cys Val Pro Cys Glu Lys Met Gly Asn Thr Ser Arg Cys Trp He 
225 230 235 240 

Pro Val Ser Pro Asn Val Ala Val Arg Gin Pro Gly Ala Leu Thr Arg 

245 250 255 

Gly Leu Arg Thr His He Asp Met Val Val Leu Ser Ala Thr Leu Cys 

260 265 270 

ser Ala Leu Tyr Val Gly Asp Leu Cys Gly Gly Val Met Leu Ala Ser 

27S 280 285 

Gin Met Phe He Val Ser Pro Gin His His Trp Phe Val Gin Glu Cys 

290 295 300 

Asn Cys Ser He Tyr Pro Gly Ala He Thr Gly Hig Arg Met Ala Trp 
305 310 315 320 

Asp Met Met Met Asn Trp Ser Pro Thr Thr Thr Met He Leu Ala Tyr 

32B 330 335 

Val Met Arg Val Pro Glu Val He He Asp He He Ser Gly Ala Hia 
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340 345 350 

Trp 61y Val Met Phe Gly Leu Ala Tyr Phe Ser Met Gin Gly Ala Trp 

355 3C0 365 

Ala Lys val Val Val He Leu Leu Leu Ala Ser Gly Val Aap Ala Tyr 

370 375 380 

Thr Thr Thr Thr Gly Ser Ala Ala Gly Arg Thr Thr Ser Ser Leu Ala 
385 390 395 400 

Ser Ala Phe Ser Pro Gly Ala Arg Gin Aan He Gin Leu He Asn Thr 

405 410 415 

Asn Gly Ser Trp Hie He Asn Arg Thr Ala Leu Aan Cya Aan Aap Ser 

420 425 430 

Leu Hia Thr Gly Phe Phe Thr Ala Leu Phe Tyr He His Lys Phe Aan 

435 440 445 

Ser Ser Gly Cys Pro Glu Arg Leu Ser Ala Cya Arg Asn He Glu Asp 

450 455 460 

Phe Arg He Gly Trp Gly Ala Leu Gin Tyr Asp Aap Asn Val Thr Aen 
465 470 475 480 

Pro Glu Asp Met Arg Pro Tyr Cys Trp Hia Tyr Pro Pro Lys Gin Cys 

485 490 495 

Gly Val Val Pro Ala Gly Thr Val Cys Gly Pro Val Tyr Cys Phe Thr 

500 505 510 

Pro Ser Pro Val Val Val Gly Thr Thr Aap Arg Leu Gly Val Pro Thr 

515 520 525 

Tyr Thr Trp Gly Glu Asn Glu Thr Asp Val Phe Leu Leu Asn Ser Thr 

530 535 540 

Arg Pro Pro Ser Gly Ser Trp Phe Gly Cys Thr Trp Met Aan Ser Thr 
545 550 S55 560 

Gly Phe Thr Lys Thr Cys Gly Ala Pro Pro Cys Arg Thr Arg Ala Asp 

565 570 575 

Phe Aan Thr Ser Thr Asp Leu Leu Cys Pro Thr Asp Cys Phe Arg Lys 

580 585 S90 

His Pro Glu Ala Thr Tyr He Lys Cys Gly Ser Gly Pro Trp Leu Thr 
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595 600 60S 

Pro Lys Cys Leu Val Asp Tyr Pro Tyr Arg Leu Trp Hi a Tyr Pro Cys 

610 615 620 

Thr val Asn Tyr Ser Thr Phe Lys lie Arg Met Tyr Val Gly Gly Val 
625 630 635 640 

Glu His Arg Leu Met Ala Ala Cys Asn Phe Thr Arg Gly Asp Arg Cys 

645 650 655 

Asn Leu Glu Asp Arg Asp Arg Ser Gin Gin Thr Pro Leu Leu His Ser 

660 665 670 

Thr Thr Glu Trp Ala lie Leu Pro Cys Ser Phe Ser Asp Leu Pro Ala 

675 680 685 

Leu ser Thr Gly Leu Leu His Leu His Gin Asn lie Val Asp Val Gin 

690 695 700 

Tyr Met Tyr Gly Leu Ser Pro Ala Leu Thr Gin Tyr lie Val Arg Trp 
705 710 715 720 

Glu Trp Val Val Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg val Cys 

725 730 735 

Ala Cys Leu Trp Met Leu lie Leu Leu Gly Gin Ala Glu Ala Ala Leu 

740 745 750 

Glu Lys Leu Val Val Leu His Ala Ala Ser Ala Ala Ser Cys Asn Gly 

755 760 765 

Phe Leu Tyr Phe Val lie Phe Leu Val Ala Ala Trp His lie Lys Gly 

770 775 780 

Arg Val Val Pro Leu Ala Ala Tyr Ser Leu Thr Gly Leu Trp Pro Phe 
785 790 795 800 

Cys Leu Leu Leu Leu Ala Leu Pro Gin Gin Ala Tyr Ala Tyr Asp Ala 

805 810 815 

Ser val His Gly Gin Val Gly Ala Ala Leu Leu Val Leu lie Thr Leu 

820 825 830 

Phe Thr Leu Thr Pro Gly Tyr Lys Thr Leu Leu Ser Gin Ser Leu Trp 

835 840 845 

Trp Leu Cys Tyr Leu Leu Thr Leu Ala Glu Thr Met Val Gin Glu Trp 
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850 855 860 

Ala Pro Ser Met Gin Ala Arg Gly Gly Arg Asp Gly lie He Trp Ala 
865 870 875 880 

Ala Thr He Phe Cya Pro Gly Val Val Phe Asp He Thr Lys Trp Leu 

885 890 895 

Leu Ala Val Leu Gly Pro Gly Tyr Leu Leu Arg Gly Ala Leu Thr Arg 

900 905 910 

Val Pro Tyr Phe Val Arg Ala His Ala Leu Leu Arg Met Cya Thr Met 

915 920 925 

Val Arg His Leu Ala Gly Gly Arg Tyr Val Gin Met Ala Leu Leu Ala 

930 935 940 

Leu Gly Arg Trp Thr Gly Thr Tyr He Tyr Asp Hia Leu Thr Pro Met 
945 950 9BS 960 

Ser Asp Trp Ala Ala Ser Gly Leu Arg Asp Leu Ala Val Ala Val Glu 

965 970 975 

Pro He He Phe Ser Pro Met Glu Lys Lys Val He Val Trp Gly Ala 

980 985 990 

Glu Thr Ala Ala Cys Gly Asp He Leu His Gly Leu Pro Val Ser Ala 

995 1000 1005 

Arg Leu Gly Arg Glu He Leu Leu Gly Pro Ala Asp Gly Tyr Thr Ser 

1010 1015 1020 

Lya Gly Trp Lys Leu Leu Ala Pro He Thr Ala Tyr Ala Gin Gin Thr 
1025 1030 1035 1040 

Arg Gly Leu Leu Gly Ser He Val Val Ser Met Thr Gly Arg Asp Lys 

1045 1050 10S5 

Thr Glu Gin Ala Gly Glu Val Gin Val Leu Ser Thr Val Thr Gin Ser 

1060 1065 1070 

Phe Leu Gly Thr Ser He Ser Gly Val Leu Trp Thr Val Tyr Hia Gly 

1075 1080 1085 

Ala Gly Asn Lys Thr Leu Ala Gly Ser Arg Gly Pro Val Thr Gin Met 

1090 1095 1100 

Tyr Ser Ser Ala Glu Gly Asp Leu Val Gly Trp Pro Ser Pro Pro Gly 
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llOB 1110 1115 1120 

Thr Lys Ser Leu Glu Pro Cys Thr Cys Gly Ala Val Asp Leu Tyr Leu 

1125 1130 1135 

Val Thr Arg Asn Ala Asp Val lie Pro Ala Arg Arg Arg Gly Asp Lys 

1140 1145 1150 

Arg Gly Ala Leu Leu Ser Pro Arg Pro Leu Ser Thr Leu Lys Gly Ser 

1155 1160 1165 

Ser Gly Gly Pro Val Leu Cys Pro Arg Gly His Ala Val Gly He Phe 

1170 1175 1180 

Arg Ala Ala Val Cys Ser Arg Gly Val Ala Lys Ser He Asp Phe He 
1185 1190 1195 1200 

Pro Val Glu Thr Leu Asp He Val Thr Arg Ser pro Thr Phe Ser Asp 

1205 1210 1215 

Asn Ser Thr Pro Pro Ala Val Pro Gin Thr Tyr Gin Val Gly Tyr Leu 

1220 1225 1230 

His Ala Pro Thr Gly Ser Gly Lys Ser Thr Lys Val Pro Val Ala Tyr 

1235 1240 1245 

Ala Ala Gin Gly Tyr Lys Val Leu Val Leu Asn Pro Ser Val Ala Ala 

1250 1255 1260 

Thr Leu Gly Phe Gly Ala Tyr Leu Ser Lys Ala His Gly He Asn Pro 
1265 1270 1275 1280 

Asn He Arg Thr Gly Val Arg Thr Val Thr Thr Gly Glu Pro He Thr 

1285 1290 1295 

Tyr Ser Thr Tyr Gly Lys Phe Leu Ala Asp Gly Gly Cys Ala Gly Gly 

1300 1305 1310 

Ala Tyr Asp He He He Cys Asp Glu Cys His Ser Val Asp Ala Thr 

1315 1320 1325 

Thr He Leu Gly He Gly Thr Val Leu Asp Gin Ala Glu Thr Ala Gly 

1330 1335 1340 

Val Arg Leu Thr Val Leu Ala Thr Ala Thr Pro Pro Gly Ser Val Thr 
1345 1350 1355 1360 

Thr Pro His Pro Asn He Glu Glu Val Ala Leu Gly Gin Glu Gly Glu 
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1365 1370 1375 

He Pro Phe Tyr Gly Arg Ala Phe Pro Leu Ser Tyr He Lys Gly Gly 

1380 1385 1390 

Arg His Leu lie Phe Cys His Ser Lys Lys Lys Cys Asp Glu Leu Ala 

1395 1400 1405 

Thr Ala Leu Arg Gly Met Gly Leu Asn Ala Val Ala Tyr Tyr Arg Gly 

1410 1415 1420 

Leu Asp Val Ser He He Pro Thr Gin Gly Asp val Val Val Val Ala 
1425 1430 1435 1440 

Thr Asp Ala Leu Met Thr Gly Tyr Thr Gly Asp Phe Asp Ser Val He 

1445 1450 1455 

Asp Cye Asn Val Ala Val Thr Gin Ala Val Asp Phe Ser Leu Asp Pro 

1460 1465 1470 

Thr Phe Thr He Thr Thr Gin Thr Val Pro Gin Asp Ala Val Ser Arg 

1475 1480 1485 

Ser Gin Arg Arg Gly Arg Thr Gly Arg Gly Arg Leu Gly He Tyr Arg 

1490 1495 1500 

Tyr Val Ser Thr Gly Glu Arg Ala Ser Gly Met Phe Asp Ser Val Val 
1505 1510 1515 1520 

Leu Cys Glu Cys Tyr Asp Ala Gly Ala Ala Trp Tyr Glu Leu Ser Pro 

1525 1530 153S 

Val Glu Thr Thr Val Arg Leu Arg Ala Tyr Phe Asn Thr Pro Gly Leu 

1540 1S45 1550 

Pro Val Cys Gin Asp His Leu Glu Phe Trp Glu Ala Val Phe Thr Gly 

1555 1560 1565 

Leu Thr His He Asp Ala His Phe Leu Ser Gin Thr Lys Gin Ser Gly 

1570 1575 1580 

Glu Asn Phe Ala Tyr Leu Val Ala Tyr Gin Ala Thr Val Cys Ala Arg 
1585 1590 1595 1600 

Ala Lys Ala Pro Pro Pro Ser Trp Asp Val Met Trp Lys Cys Leu Thr 

1605 1610 1615 

Arg Leu Lys Pro Thr Leu Val Gly Pro Thr Pro Leu Leu Tyr Arg Leu 
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1620 1625 1630 

Gly Ser Val Thr Asn Glu Val Thr Leu Thr His Pro Val Thr Lys Tyr 

1635 1640 164S 

He Ala Thr Cys Met Gin Ala Asp Leu Glu Val Met Thr Ser Thr Trp 

1650 1655 1660 

Val Leu Ala Gly Gly Val Leu Ala Ala Val Ala Ala Tyr Cys Leu Ala 
1665 1670 1675 1680 

Thr Gly Cys Val Ser He He Gly Arg Leu His He Asn Gin Arg Ala 

1685 1690 1695 

Val Val Ala Pro Aap Lys Glu Val Leu Tyr Glu Ala Phe Asp Glu Met 

1700 1705 1710 

Glu Glu Cys Ala Ser Arg Ala Ala Leu Leu Glu Glu Gly Gin Arg He 

1715 1720 1725 

Ala Glu Met Leu Lys Ser Lys He Gin Gly Leu Leu Gin Gin Ala Ser 

1730 1735 1740 

Lya Gin Ala Gin Asp He Gin Pro Ala Val Gin Ala Ser Trp Pro Lys 
1745 1750 1755 1760 

Met Glu Gin phe Trp Ala Lys His Met Trp Asn phe He Ser Gly lie 

1765 1770 1775 

Gin Tyr Leu Ala Gly Leu Ser Thr Leu Pro Gly Asn Pro Ala Val Ala 

1780 1785 1790 

Ser Met Met Ala Phe Ser Ala Ala Leu Thr Ser Pro Leu Ser Thr Ser 

1795 1800 180S 

Thr Thr He Leu Leu Asn lie Leu Gly Gly Trp Leu Ala Ser Gin He 

1810 1815 1820 

Ala Pro Pro Ala Gly Ala Thr Gly Phe Val Val Ser Gly Leu Val Gly 
1825 1830 1835 1840 

Ala Ala Val Gly Ser He Gly Leu Gly Lys Val Leu Val Asp He Leu 

1845 1850 1855 

Ala Gly Tyr Gly Ala Gly He Ser Gly Ala Leu Val Ala Phe Lys He 

1860 1865 1870 

Met Ser Gly Glu Lys Pro Ser Met Glu Asp Val He Asn Leu Leu Pro 
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1875 1880 1885 

Gly He Leu Ser Pro Gly Ala Leu Val Val Gly Val He Cys Ala Ala 

1890 1895 1900 

He Leu Arg Arg His Val Gly Pro Gly Glu Gly Ala Val Gin Trp Met 
1905 1910 1915 1920 

Asn Arg Leu He Ala Phe Ala Ser Arg Gly Asn His Val Ala Pro Thr 

1925 1930 1935 

His Tyr Val Thr Glu Ser Asp Ala Ser Gin Arg Val Thr Gin Leu Leu 

1940 1945 1950 

Gly Ser Leu Thr He Thr Ser Leu Leu Arg Arg Leu His Asn Trp He 

195S 1960 196S 

Thr Glu Asp Cys Pro He Pro Cys Ala Gly Ser Trp Leu Arg Asp Val 

1970 1975 1980 

Trp Asp Trp Val Cys Thr He Leu Thr Asp Phe Lys Asn Trp Leu Thr 
1985 1990 1995 2000 

Ser Lys Leu Phe Pro Lys Met Pro Gly Leu Pro Phe He Ser Cys Gin 

2005 2010 2015 

Lys Gly Tyr Lys Gly Val Trp Ala Gly Thr Gly He Met Thr Thr Arg 

2020 2025 2030 

Cys Pro Cys Gly Ala Asn He Ser Gly Asn Val Arg Leu Gly Ser Met 

2035 2040 2045 

Arg He Thr Gly Pro Lys Thr Cys Met Asn Thr Trp Gin Gly Thr Phe 

2050 2055 2060 

Pro He Asn Cys Tyr Thr Glu Gly Gin Cys Leu Pro Lys Pro Ala Leu 
2065 2070 2075 2080 

Asn Phe Lys Thr Ala He Trp Arg Val Ala Ala Ser Glu Tyr Ala Glu 

2085 2090 2095 

Val Thr Gin His Gly Ser Tyr Ala Tyr He Thr Gly Leu Thr Thr Asp 

2100 2105 2110 

Asn Leu Lys Val Pro Cys Gin Leu Pro Ser Pro Glu Phe Phe Ser Trp 

2115 2120 2125 

Val Asp Gly Val Gin He His Arg Ser Ala Pro Thr Pro Lys Pro Phe 
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2130 2135 
Phe Arg Asp Glu Val Ser Phe Ser 
2145 2150 
Gly Ser Gin Leu Pro Cys Asp Pro 
2165 2 
Ser Met Leu Thr Asp Pro Ser His 
2180 2185 
Arg Leu Ala Arg Gly Ser Pro Pro 

2195 2200 
Gin Leu Ser Ala Pro Ser Leu Arg 

2210 2215 
Thr Tyr Asp Val Asp Met Val Asp 
2225 2230 
Val lie Arg lie Glu Ser Glu Ser 
2245 2 
Asp Ser Met Thr Glu Glu Glu Gly 
2260 2265 
Glu Tyr Met Leu Pro Arg Lya Arg 

2275 2280 
Ala Arg Pro Asp Tyr Asn Pro Pro 

2290 2295 
Asp Tyr Gin Pro Pro Thr val Ala 
2305 2310 
Lys Thr Pro Thr Pro Pro Pro Arg 
2325 2 
Glu ser Thr He Gly Asp Ala Leu 
2340 2345 
Gly 61n Pro Pro Pro Ser Gly Asp 

2355 2360 
Ala Ala Asp Ser Gly Aap Arg Thr 

2370 2375 
Glu Thr Gly Ser Thr Ser Ser Met 



2140 

val Gly Leu Asn Ser Phe Val Val 

2155 2160 
Glu Pro Asp Thr Glu Val Val Met 
170 2175 
He Thr Ala Glu Ala Ala Ala Arg 
2190 

Ser Glu Ala Ser Ser Ser Ala Ser 
2205 

Ala Thr Cys Thr Thr His Gly Arg 
2220 

Ala Asn Leu Phe Met Gly Gly Gly 

2235 2240 
Lys Val Val Val Leu Asp Ser Leu 
250 2255 
Asp Leu Glu Pro Ser Val Pro Ser 
2270 

Phe Pro Pro Ala Leu Pro Ala Trp 
2285 

Leu Val Glu Ser Trp Lys Arg Pro 
2300 

Gly Cys Ala Leu Pro Pro Pro Lys 

2315 2320 
Arg Arg Arg Thr Val Gly Leu Ser 
330 2335 
Gin Gin Leu Ala He Lys Ser Phe 
2350 

Ser Gly Leu Ser Thr Gly Ala Asp 
236S 

Pro Pro Asp Glu Leu Ala Leu Ser 
2380 

Pro Pro Leu Glu Gly Glu Pro Gly 
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2385 2390 2395 2400 

Asp Pro Asp Leu Glu Pro Glu Gin Val Glu Leu Gin Pro Pro Pro Gin 

2405 2410 2415 

Gly Gly Glu Ala Ala Pro Gly Ser Asp Ser Gly Ser Trp Ser Thr Cys 

2420 2425 2430 

Ser Glu Glu Asp Asp Ser Val Val Cys Cys Ser Met Ser Tyr Ser Trp 

2435 2440 2445 

Thr Gly Ala Leu He Thr Pro Cys Ser Pro Glu Glu Glu Lys Leu Pro 

2450 2455 2460 

He Asn Ser Leu Ser Asn Ser Leu Leu Arg Tyr His Asn Lya Val Tyr 
2465 2470 2475 2480 

Cys Thr Thr Ser Lys Ser Ala Ser Leu Arg Ala Lys Lys Val Thr Phe 

24B5 2490 2495 

Asp Arg Met Gin Val Leu Asp Ala Tyr Tyr Asp Ser Val Leu Lys Asp 

2500 2505 2510 

He Lys Leu Ala Ala Ser Lys Val Ser Ala Arg Leu Leu Thr Leu Glu 

2515 2520 2525 

Glu Ala Cys Gin Leu Thr Pro Pro His Ser Ala Arg Ser Lys Tyr Gly 

2530 2535 2540 

Phe Gly Ala Lys Glu Val Arg Ser Leu Ser Gly Arg Ala Val Asn His 
2545 2550 2555 2560 

He Lys Ser Val Trp Lys Asp Leu Leu Glu Asp Ser Gin Thr Pro He 

2565 2570 2575 

Pro Thr Thr He Met Ala Lys Asn Glu Val Phe Cys Val Asp Pro Ala 

2580 2585 2590 

Lys Gly Gly Lys Lys Pro Ala Arg Leu He Val Tyr Pro Asp Leu Gly 

2595 2600 2605 

Val Arg Val Cys Glu Lys Met Ala Leu Tyr Asp Val Thr Gin Lys Leu 

2610 2615 2620 

Pro Gin Ala Val Met Gly Ala Ser Tyr Gly Phe Gin Tyr Ser Pro Ala 
2625 2630 2635 2640 

Gin Arg Val Glu Phe Leu Leu Lys Ala Trp Ala Glu Lys Arg Asp Pro 
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2645 2650 2655 

Met Gly Phe Ser Tyr Asp Thr Arg Cys Phe Asp Ser Thr Val Thr Glu 

2660 2665 2670 

Arg Asp He Arg Thr Glu Glu Ser He Tyr Gin Ala Cys Ser Leu Pro 

2675 2680 268S 

Glu Glu Ala Arg Thr Ala He His Ser Leu Thr Glu Arg Leu Tyr Val 

2690 2695 2700 

Gly Gly Pro Met Phe Asn Ser Lys Gly Gin Ser Cys Gly Tyr Arg Arg 
2705 2710 2715 2720 

Cys Arg Ala Ser Gly Val Leu Thr Thr Ser Met Gly Asn Thr He Thr 

2725 2730 2735 

Cys Tyr Val Lys Ala Leu Ala Ala Cys Lye Ala Ala Gly He He Ala 

2740 2745 2750 

Pro Thr Met Leu Val Cya Gly Asp Asp Leu Val Val He Ser Glu Ser 

2755 2760 2765 

Gin Gly Thr Glu Glu Asp Glu Arg Asn Leu Arg Ala Phe Thr Glu Ala 

2770 2775 2780 

Met Thr Arg Tyr Ser Ala Pro Pro Gly Asp Pro Pro Arg Pro Glu Tyr 
2785 2790 2795 2800 

Asp Leu Glu Leu He Thr Ser Cys Ser Ser Asn Val Ser Val Ala Leu 

2805 2810 2815 

Gly Pro Gin Gly Arg Arg Arg Tyr Tyr Leu Thr Arg Asp Pro Thr Thr 

2820 2825 2830 

Ser He Ala Arg Ala Ala Trp Glu Thr Val Arg His Ser Pro Val Asn 

2835 2840 2845 

Ser Trp Leu Gly Asn He He Gin Tyr Ala Pro Thr He Trp Val Arg 

2850 285S 2860 

Met Val Leu Met Thr His Phe Phe Ser He Leu Met Ala Gin Asp Thr 
2865 2870 2875 2880 

Leu Asp Gin Asn Leu Asn Phe Glu Met Tyr Gly Ser Val Tyr Ser Val 

2885 2890 2895 

Ser Pro Leu Asp Leu Pro Ala He He Glu Arg Leu His Gly Leu Asp 
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2900 2905 2910 

Ala Phe Ser Leu Hia Thr Tyr Thr Pro Hie Glu Leu Thr Arg Val Ala 

2915 2920 2925 

Ser Ala Leu Arg Lys Leu Gly Ala Pro Pro Leu Arg Ala Trp Lys Ser 

2930 2935 2940 

Arg Ala Arg Ala Val Arg Ala Ser Leu lie Ser Arg Gly Gly Arg Ala 
2945 2950 2955 2960 

Ala val Cys Gly Arg Tyr Leu Phe Asn Trp Ala Val Lys Thr Lys Leu 

2965 2970 2975 

Lya Leu Thr Pro Leu Pro Glu Ala Arg Leu Leu Asp Leu Ser Ser Trp 

2980 2985 2990 

Phe Thr Val Gly Ala Gly Gly Gly Asp lie Tyr His Ser Val Ser Arg 

2995 3000 3005 

Ala Arg Pro Arg Leu Leu Leu Leu Ser Leu Leu Leu Leu Ser Val Gly 

3010 3015 3020 

Val Gly Leu Phe Leu Leu Pro Ala Arg 
3025 3030 



<210> 7 
<211> 8024 
<212> RNA 

•;213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: replicon 
<400> 7 

accugccccu aauaggggcg acacuccgcc augaaucacu ccccugugag gaacuacugu 60 
cuucacgcag aaagcgccua gccauggcgu uaguaugagu gucguacagc cuccaggccc 120 
cccccucccg ggagagccau aguggucugc ggaaccggug aguacaccgg aauugccggg 180 
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aagacugggu ccuuucuugg auaaacccac ucuaugcccg gccauuuggg cgugcccccg 24 0 
caagaougcu agccgaguag cguuggguug cgaaaggccu ugugguacug ccugauaggg 3 00 
cgcuugcgag ugccccggga ggucucguag accgugcacc augagcacaa auccuaaacc 360 
ucaaagaaaa accaaaagaa acaccaaccg ucgcccaaug auugaacaag auggauugca 420 
cgcagguucu ccggccgcuu ggguggagag gcuauucggc uaugacuggg cacaacagac 4 80 
aaucggcugc ucugaugccg ccguguuccg gcugucagcg caggggcgcc cgguucuuuu 54 0 
ugucaagacc gaccuguccg gugcccugaa ugaacugcag gacgaggcag cgcggcuauc 600 
guggcuggcc acgacgggcg uuccuugcgc agcugugcuc gacguuguca cugaagcggg 660 
aagggacugg cugcuauugg gcgaagugcc ggggcaggau cuccugucau cucaccuugc 72 0 
uccugccgag aaaguaucca ucauggcuga ugcaaugcgg cggcugcaua cgouugaucc 780 
ggcuaccugc ccauucgacc accaagcgaa acaucgcauc gagcgagcac guacucggau 84 0 
ggaagccggu cuugucgauc aggaugaucu ggacgaagag caucaggggc ucgcgccagc 900 
cgaacuguuo gccaggcuca aggcgcgcau goccgacggc gaggaucucg ucgugaccca 960 
uggcgaugcc ugcuugccga auaucauggu ggaaaauggc cgcuuuucug gauucaucga 102 0 
cuguggccgg cugggugugg cggaccgcua ucaggacaua gcguuggcua cccgugauau 1080 
ugcugaagag cuuggcggcg aaugggcuga ccgcuuccuc gugcuuuacg guaucgccgc 114 0 
ucccgauucg cagcgcaucg ccuucuaucg ccuucuugac gaguucuucu gaguuuaaac 1200 
ccucucccuc cccccccccu aacguuacug gccgaagccg cuuggaauaa ggccggugug 1260 
cguuugucua uauguuauuu uccaccauau ugccgucuuu uggcaaugug agggcccgga 1320 
aaccuggccc ugucuucuug acgagcauuc cuaggggucu uuccccucuc gccaaaggaa 1380 
ugcaaggucu guugaauguc gugaaggaag cagviuccucu ggaagcuucu ugaagacaaa 1440 
caacgucugu agcgacccuu ugcaggcagc ggaacccccc accuggcgac aggugccucu 1500 
gcggccaaaa gccacgugua uaagauacac cugcaaaggc ggcacaaccc cagugccacg 1560 
uugugaguug gauaguugug gaaagaguca aauggcucuc cucaagcgua uucaacaagg 1620 
ggcugaagga ugcccagaag guaccccauu guaugggauc ugaucugggg ccucggugca 1680 
caugcuuuac auguguuuag ucgagguuaa aaaaacgucu aggccccccg aaccacgggg 174 0 
acgugguuuu ccuuugaaaa acacgaugau accauggcuc ccaucacugc uuaugcccag 1800 
caaacacgag gccuccuggg cgccauagug gugaguauga cggggcguga caggacagaa 1860 
caggccgggg aaguccaaau ccuguccaca gucucucagu ccuuccucgg aacaaccauc 1920 
ucggggguuu uguggacugu uuaccacgga gcuggcaaca agacucuagc cggcuuacgg 19 BO 
gguccgguca cgcagaugua cucgagugcu gagggggacu ugguaggcug gcecagcccc 204 0 
ccugggacca agucuuugga gccgugcaag uguggagccg ucgaccuaua ucuggucacg 2100 
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cggaacgcug augucauccc ggoucggaga cgcggggaca agcggggagc auugcucucc 2160 
ccgagaccca uuucgaccuu gaaggggucc ucgggggggc cggugcucug cccuaggggc 2220 
cacgucguug ggcucuuccg agcagcugug ugcucucggg gcguggccaa auccawcgau 2280 
uucauccccg uugagacacu cgacguuguu acaaggucuc ccacuuucag ugacaacagc 2340 
acgccaccgg cugugcccca gaccuaucag gucggguacu ugcaugcucc aacuggcagu 2400 
ggaaagagca ccaagguccc ugucgcguau gocgcccagg gguacaaagu acuagugcuu 24 6 0 
aaccccucgg uagcugccac ccugggguuu ggggcguacc uauccaaggc acauggcauc 2 52 0 
aaucccaaca uuaggacugg agucaggacc gugaugaccg gggaggccau cacguacucc 2580 
acauauggca aauuucucgc cgaugggggc ugcgcuagcg gcgccuauga caucaucaua 2 64 0 
ugcgaugaau gccacgcugu ggaugcuacc uccauucucg gcaucggaac gguccuugau 2700 
caagcagaga cagccggggu cagacuaacu gugcuggcua cggccacacc ccccggguca 2760 
gugacaaccc cccaucccga uauagaaaag guaggccucg ggcgggaggg ugagaucccc 282 0 
uucuauggga gggcgauucc ccuauccugc aucaagggag ggagacaccu gauuuucugc 2880 
cacucaaaga aaaaguguga cgagcucgcg gcggcccuuc ggggcauggg cuugaaugcc 2940 
guggcauacu auagaggguu ggacgucucc auaauaccag cucagggaga uguggugguc 3000 
gucgccaccg acgcccucau gacgggguac acuggagacu uugacuccgu gaucgacugc 3 060 
aauguagcgg ucacccaagc ugucgacuuc agccuggacc ccaccuucac uauaaccaca 3120 
cagacugucc cacaagacgc ugucucacgc agucagcgcc gcgggcgcac agguagagga 3180 
agacagggca cuuauaggua uguuuccacu ggugaacgag ccucaggaau guuugacagu 3240 
guagugcuuu gugagugcua cgacgcaggg gcugcguggu acgaucucac accagcggag 3 3 00 
accaccguca ggcuuagagc guauuucaac acgcccggcc uacccgugug ucaagaccau 3360 



cuugaauuuu gggaggcagu uuucaccggc 



3420 



caaacaaagc aagcggggga gaacuucgcg uaccuaguag ccuaccaagc uacggugugc 3480 
gcoagagcca aggccccucc cccguccugg gacgccaugu ggaagugccu ggcccgacuc 3 540 
aagccuacgc uugcgggccc cacaccucuc cuguaccguu ugggcccuau uaccaaugag 3 500 
gucacccuca cacacccugg gacgaaguac aucgccacau gcaugcaagc ugaccuugag 3 660 
gucaugacca gcacgugggu cciaagcugga ggaguccugg cagccgucgc cgcauauugc 3 720 
cuggcgacug gaugcguuuc caucaucggc cgcuugcacg ucaaccagcg agucgucguu 3 780 
gcgccggaua aggagguccu guaugaggcu uuugaugaga uggaggaaug cgccucuagg 3 340 
gcggcucuca ucgaagaggg gcagcggaua gccgagaugu ugaaguccaa gauccaaggc 3 900 
uugcugcagc aggccucuaa gcaggcccag gacauacaac ccgcuaugca ggcuucaugg 3960 
cccaaagugg aacaauuuug ggccagacac auguggaacu ucauuagcgg cauccaauac 402 0 
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cucgcaggau ugucaacacu gccagggaac cccgcggugg cuuccaugau ggcauucagu 4080 
gccgcccuca ccaguccguu gucgaccagu accaccaucc uucucaacau caugggaggc 4140 
ugguuagcgu cccagaucgc accacccgcg ggggccaccg gcuuugucgu caguggccug 4200 
gugggggcug ccgugggcag cauaggccug gguaaggugc ugguggacau ccuggcagga 42 6 0 
uauggugcgg gcauuucggg ggcccucguc gcauucaaga ucaugucugg cgagaagccc 4320 
ucuauggaag augucaucaa ucuacugccu gggauccugu cuccgggagc ccugguggug 43 80 
ggggucaucu gcgcggccau ucugcgccgc cacgugggac cgggggaggg cgcgguccaa 44 4 0 
uggaugaaca ggcuuauugc cuuugcuucc agaggaaacc acgucgcccc uacucacuac 4500 
gugacggagu cggaugcguc gcagcgugug acccaacuac uuggcucuou uacuauaacc 4 5 60 
agccuacuca gaagacucca caauuggaua acugaggacu gccccauccc augcuccgga 462 0 
uccuggcucc gcgacgugug ggaouggguu ugcacoaucu ugacagacuu caaaaauugg 4680 
cugaccucua aauuguuccc caagcugccc ggccucaccu ucaucucuug ucaaaagggg 4740 
uacaagggug ugugggccgg cacuggcauc augaccacgc gcugcccuug cggcgccaac 4800 
aucucuggca auguccgccu gggcucuaug aggaucacag ggccuaaaac cugcaugaac 4860 
accuggcagg ggaccuuucc uaucaauugc uacacggagg gccagugcgc gccgaaaccc 4920 
cccacgaacu acaagaccgc caucuggagg guggcggccu cggaguacgc ggaggugacg 4 98 0 
cagcaugggu cguacuccua uguaacagga cugaccacug acaaucugaa aauuccuugc 5040 
caacnaccuu cuccagaguu uuucuccugg guggacggug ugcagaucca uagguuugca 5100 
cccacaccaa agccguuuuu ccgggaugag gucucguucu gcguugggcu uaauuccuau S160 
gcugucgggu cccagcuucc cugugaaccu gagcccgacg cagacguauu gagguccaug 5220 
cuaacagauc cgccccacau cacggcggag acugcggcgc ggcgcuuggc acggggauca 5280 
ccuccaucug aggcgagcuc cucagugagc cagcuaucag caccgucgcu gcgggccacc 5340 
ugcaccaccc acagcaacac cuaugacgug gacauggucg augccaaccu gcucauggag 54 00 
ggcggugugg cucagacaga gccugagucc agggugcccg uucuggacuu ucucgagcca 54 60 
auggccgagg aagagagcga ccuugagccc ucaauacoau cggagugcau gcuccccagg 5520 
agcggguuuc cacgggccuu accggcuugg gcacggccug acuacaaccc gccgcucgug 5580 
gaaucgugga ggaggccaga uuaccaaccg cccaccguug cugguugugc ucuccccccc 5640 
cccaagaagg ccccgacgcc ucccccaagg agacgccgga cagugggucu gagcgagagc 5700 
accauaucag aagcccucca gcaacuggcc aucaagaccu xiuggccagcc ccccucgagc 5760 
ggugaugcag gcxicguccac gggggcgggc gccgccgaaii ccggcggucc gacguccccu 5820 
ggugagccgg cccccucaga gacagguucc gccuccucua ugcccccccu cgagggggag 5880 
ccuggagauc cggaccugga gucugaucag guagagcuuc aaccuccccc ccaggggggg 5940 
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gggguagcuc 
accgugugcu 
gaagaggaaa 
guguacugua 
acgcaagugc 
aaggucagcg 
gcaagaucca 



ccgguucggg 
gcuccauguc 
aguugccaau 



accaucaugg 
gcucgocuca 
gacauuacac 
ccugcccaac 
uuuucguaug 
gaguccauau 
acugagagac 
agacguugcc 
gugaaagccc 
ggcaaugacc 
agagccuuca 
gaauaugacc 
cggggccgcc 
ugggaaacag 

gacacccugg 
uuggaccuuc 

cucagggugu 
aaagcggccg 

gggggcgaca 



wcgacgccca 
caaggcuccu 
aguauggauu 
aguccgugug 
ccaaaaauga 
ucguuuaccc 
aaaagcuucc 
ggguggagua 
auacccgaug 
accaggccug 
uuuacguagg 
gcgccagcgg 
uagcggccug 
uaguagucau 
cggaggccau 
uggagcuaau 
gcagauacua 
uuagacacuc 
ggguucgcau 
accagaaccw 

acgaacugac 
ggaagagucg 
uuugcggccg 
cggaggcgcg 



cucggggucu 
auacuccugg 
caacccuuug 
gagcgccuca 
uuaugacuca 
caccuuggag 
csfsgsccaag 
gaaggaccuc 
gguguucugc 
ugaccucggc 
ucaggcggua 
ucucuugaaa 
cuucgacuca 
cucccugccc 
agggcccaug 
ggugcuaacc 
caaggcugcg 
cucagaaagc 
gaccagguac 
aacauccugu 
ocugaccaga 
cccuaucaau 
gguccuaaug 
caacuuugag 
ugagagguua 
gcggguggc\i 
ggcucgcgca 



; ucguaggggu 
ccauagcuaa 



ccuacuggac 
cgugucgcgc 
aggccucuuc 
cuguuccuuu 



uggucuacuu 
accggggcuc 
aguaacucgc 
cagagggcua 
gucuuaaagg 
gaggcgugcc 
gagguccgca 
cuggaagacc 
guggaccccg 
guccgggucu 
augggagcuu 
gcaugggcgg 
accgucacug 
gaggaggccc 
uucaacagca 
acuagcaugg 
gggauaguug 
caggggacug 
ucugccccuc 
uccucaaaug 
gacccaacca 
ucauggcugg 

auguauggau 
cacgggcuug 
ucagcccuca 
gucagggcgu 
aauugggcgg 
uuauccaguu 



gcuccgagga 
uaauaacucc 
uguugcgaua 
aaaagguaac 
acaucaagcu 
aguugacucc 
gcuuguccgg 
cacaaacacc 
ccaagggggg 
gcgagaaaau 
ccuauggcuu 
aaaagaagga 
agagagacau 
gcacugccau 
agggucaaac 
guaacaccau 
cgcccacaau 
aggaggacga 
cuggugaucc 
ugucuguggc 



ggacgauacc 
cuguagcccc 
ccauaacaag 
uuuugacagg 
agcggcuucc 
accccauucu 
gagggccguu 
aauucccaca 
uaagaaacca 
ggcccucuau 
ccaguacucc 
ccccaugggu 
caggaccgag 
acacucgcug 
cugcgguuac 
cacaugcuau 
gcugguaugc 
gcggaaccug 

guugggcccg 
ccgggcugcc 
ccaguaugcu 
caiigguccaa 
cgugaauccu 



ugaagaccaa gcucaaacuc 
gguucaccgu cggcgccggc 
gcucauuacu cuucggccua 
c\icgguagag cggcacacac 



6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
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uuuuuuuuuu cuuuuuuuuu uuuuucccuc 
uuucuuggug gcuccaucuu agcccuaguc 
augacugcag agagugccgu aacuggucuc 



uuucuucccu ucucaucuua uucuacuuuc 7920 
acggcuagcu gugaaagguc cgugagccgc 7980 
ucugcagauc augu 8024 



<210> 8 
<211> 7994 
<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; replicon 
<400> 8 

accugccccu aauaggggcg acacuccgcc augaaucacu ccccugugag gaacuacugu 60 
cuucacgcag aaagcgccxia gccauggcgu uaguaugagu gucguacagc cuccaggccc 120 
cccccucccg ggagagccau aguggucugc ggaaccggug aguacaccgg aauugccggg 180 
aagacugggu ccuuucuugg auaaacccac ucuaugcccg gccauuuggg cgugcccccg 240 
caagacugcu agccgaguag cguuggguug cgaaaggccu ugugguacug ccugauaggg 3 00 
cgcuugcgag ugccccggga ggucucguag accgugcacc augagcacaa auccuaaacc 360 
ucaaagaaaa accaaaagaa acaccaaccg ucgcccaaug auugaacaag auggauugca 420 
cgcagguucu ccggccgcuu ggguggagag gcuauucggc uaugacuggg cacaacagac 4 80 
aaucggcugc ucugaugccg ccguguuccg gcugucagcg caggggcgcc cgguucuuuu 54 0 
ugucaagacc gaccuguccg gugcccugaa ugaacugcag gacgaggcag cgcggcuauc 600 
guggcuggcc acgacgggcg uuccuugcgc agcugugcuc gacguuguca cugaagcggg 660 
aagggacugg cugcuauugg gcgaagugcc ggggcaggau cuccugucau cucaccuugc 72 0 
ucc\igccgag aaaguaucca ucauggcuga ugcaaugcgg cggcugcaua cgcuugaucc 78 0 
ggcuaccugc ccauucgacc accaagcgaa acaucgcauc gagcgagcac guacucggau 84 0 
ggaagccggu cuugucgauc aggaugaucu ggacgaagag caucaggggc ucgcgccagc 90 0 
cgaacuguuc gccaggcuca aggcgcgcau gcccgacggc gaggaucucg ucgugaccca 960 
uggcgaugcc ugcuugccga auaucauggu ggaaaauggc cgcuuuucug gauucaucga 1020 
cuguggccgg cugggugugg cggaccgcua ucaggacaua gcguuggcua cccgugauau 1080 
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ugcugaagag cuuggcggcg aaugggcuga ccgcuuccuc gugcuuuacg guaucgccgc 114 0 
ucccgauucg cagcgcaucg ccuucuaucg ccuuouugac gaguucuucu gaguuuaaac 1200 
ccucucccuc cccccccccu aacguuacug gccgaagccg cuuggaauaa ggccggugug 12 60 
cguuugucua uauguuauuu uccaccauau ugccgucuuu uggcaaugug agggcccgga 1320 
aaccuggccc ugucuucuwg acgagcauuc cuaggggucu uuccccucuc gccaaaggaa 1380 
ugcaaggucu guugaauguc gugaaggaag caguuccucu ggaagcuucu ugaagacaaa 14 4 0 
caacgucugu agcgacccuu ugcaggcagc ggaacccccc accuggcgac aggugccucu 1500 
gcggccaaaa gccacgugua uaagauacac cugcaaaggc ggcacaaccc cagugccacg 1560 
uugugaguug gauaguugug gaaagaguca aauggcucuc cucaagcgua uucaacaagg 1620 
ggcugaagga ugcccagaag guaccccauu guaugggauc ugaucugggg ccucggugca 1680 
caugcuuuac auguguuuag ucgagguuaa aaaaacgucu aggccccccg aaccacgggg 1740 
acgugguuuu ccuuugaaaa acacgaugau accauggcuc ccaucacugc uuaugcccag 18 0 0 
caaacacgag gccuccuggg cgccauagug gugaguauga cggggcguga caggacagaa 1860 
caggccgggg aaguccaaau ccuguccaca gucucucagu ccuuccucgg aacaaccauc 1920 
ucggggguuu uguggacugu uuaccacgga gcuggcaaca agacucuagc cggcuuacgg 1980 
gguccgguca cgcagaugvta cucgagugcu gagggggacu ugguaggcug gcccagcccc 2040 
ccugggacca agucuuugga gccgugcaag uguggagccg ucgaccuaua ucuggucacg 2100 
cggaacgcug augucauccc ggcucggaga cgcggggaca agcggggagc auugcucucc 2160 
ccgagaccca uuucgaccuu gaaggggucc ucgggggggc cggugcucug cccuaggggc 2220 
cacgucguug ggcucuuccg agcagcugug ugcucucggg gcguggccaa auccaucgau 2280 
uucauccccg uugagacacu cgacguuguu acaaggucuc ccacuuucag ugacaacagc 2340 
acgccaccgg cugugcccca gaccuaucag gucggguacu ugcaugcucc aacuggcagu 24 00 
ggaaagagca ccaagguccc ugucgcguau gccgcccagg gguacaaagu acuagugcuu 2460 
aaccccucgg uagcugccac ccugggguuu ggggcguacc uauccaaggc acauggcauc 2520 
aaucccaaca uuaggacugg agucaggacc gugaugaccg gggaggccau cacguacucc 2580 
acauauggca aauuucucgc cgaugggggc ugcgcuagcg gcgccuauga caucaucaua 2640 
ugcgaugaau gccacgcugu ggaugcuacc uccauucucg gcaucggaac gguccuugau 270 0 
caagcagaga cagccggggu cagacuaacu gugcuggcua cggccacacc ccccggguca 2760 
gugacaaccc cccaucccga uauagaagag guaggccucg ggcgggaggg ugagaucccc 2820 
uucuauggga gggcgauucc ccuauccugc aucaagggag ggagacaccu gauuuucugc 2880 
cacucaaaga aaaaguguga cgagcucgcg gcggcccuuc ggggcauggg cuugaaugcc 2940 
guggcauacu auagaggguu ggacgucucc auaauaccag cucagggaga uguggugguc 3 0 00 
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gucgccaccg acgcccucau gacgggguac acuggagacu uugacuccgu gaucgacugc 3 060 
aauguagcgg ucacccaagc ugucgacuuc agccuggacc ccaccuucac uauaaccaca 3120 
cagacugucc cacaagacgc ugucucacgc agucagcgcc gcgggcgcac agguagagga 3180 
agacagggca cuuauaggua uguuuccacu ggugaacgag ccucaggaau guuugacagu 3240 
guagugcuuu gugagugcua cgacgcaggg gcugcguggu acgaucucac accagcggag 33 00 
accaccguca ggcuuagagc guauuucaac acgcccggcc uacccgugug ucaagaccau 3360 
cuugaauuuu gggaggcagu uuucaccggc cucacacaca uagacgccca cuuccucucc 3420 
caaacaaagc aagcggggga gaacuucgcg uaccuaguag ccuaccaagc uacggugugc 3480 
gccagagcca aggccccucc cccguccugg gacgccaugu ggaagugccu ggcccgacuc 3540 
aagccuacgc uugcgggccc cacaccucuc cuguaccguu ugggcccuaw uaccaaugag 3600 
gucacccuca cacacccugg gacgaaguac aucgccacau gcaugcaagc ugaccuugag 3660 
gucaugacca gcacgugggu ccuagougga ggaguccugg cagccgucgc cgcauauugc 3 72 0 
cuggcgacug gaugcguuuc caucaucggo cgcuugcacg ucaaccagcg agucgucguu 378 0 
gcgccggaua aggagguccu guaugaggcu uuugaugaga uggaggaaug cgccucuagg 3 84 0 
gcggcucvica ucgaagaggg gcagcggaua gccgagaugu ugaagucoaa gauccaaggc 3900 
uugcugcagc aggccucuaa gcaggcccag gacauacaac ccgcuaugca ggcuucaugg 3960 
cccaaagugg aacaayuuug ggccagacac auguggaacu ucauuagcgg cauccaauac 4020 
cucgcaggau ugucaacacu gccagggaac cccgcggugg cuuccaugau ggcauucagu 4080 
gccgcccuca ccaguccguu gucgaccagu accaccaucc uucucaacau caugggaggc 4140 
ugguuagcgu cccagaucgc accacccgcg ggggccaccg gcuuugucgu caguggccug 4200 
gugggggcug ccgugggcag cauaggccug gguaaggugc ugguggacau ccuggcagga 4260 
uauggugcgg gcauuucggg ggcccucguc gcauucaaga ucaugucugg cgagaagccc 4320 
ucuauggaag augucaucaa ucuacugccu gggauccugu cuccgggagc ccugguggug 4380 
ggggucaucu gogcggccau ucugcgccgc cacgugggac cgggggaggg cgcgguccaa 4440 
uggaugaaca ggcuuauugc cviuugcuucc agaggaaacc acgucgcccc uacucacuac 4500 
gugacggagu cggaugcguc gcagcgugug acccaacuac uuggcucucu uacuauaacc 4560 
agcGuacuca gaagacucca caauuggaua acugaggacu gccccaviccc augcuccgga 4620 
uccuggcucc gcgacgugug ggacuggguu ugcaccaucu ugacagacuu caaaaauugg 4680 
cugaccucua aauuguuccc caagcugccc ggccuccccu ucaucucuug ucaaaagggg 4740 
uacaagggug ugugggccgg cacuggcauc augaccacgc gcugcccuug cggcgccaac 4800 
aucucuggca auguccgccu gggcucuaug aggaucacag ggccuaaaac cugcaugaac 4860 
accuggcagg ggaccuuucc uaucaauugc uacacggagg gccagugcgc gccgaaaccc 4920 
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cccacgaacu acaagaccgc caucuggagg guggcggccu cggaguacgc ggaggugacg 4980 
cagcaugggu cguacuccua uguaacagga cugaccacug acaaucugaa aauuccwugc 5 04 0 
oaacuaccuu cuccagaguu uuucuccugg guggacggug ugcagaucca uagguuugca 5100 
ccoacaccaa agccguuuuu ccgggaugag gucucguucu gcguugggcu uaauuccuau 5160 
gcugucgggu cccagcuucc cugugaaccu gagcccgacg cagacguauu gagguccaug 5220 
cuaacagauc cgccccacau cacggcggag acugcggcgc ggcgcuuggc acggggauca 5280 
ccuccaucug aggcgagcuc cucagugagc cagcuaucag caccgucgcu gcgggccacc 53 4 0 
ugcaccaccc acagcaacac cuaugacgug gacauggucg augccaaccu gcucauggag 5400 
ggcggugugg cucagacaga gccugagucc agggugcccg uucuggacuu ucucgagcca 5460 
auggccgagg aagagagcga ccuugagccc ucaauaccau cggagugcau gcuccccagg 5S20 
agcggguuuc cacgggccuu accggcuugg gcacggccug acuaoaaccc gccgcucgug 5580 
gaaucgugga ggaggccaga uuaccaaccg cccaccguug cugguugugc ucuccccccc 5640 
cccaagaagg ccccgacgcc ucccccaagg agacgccgga cagugggucu gagcgagagc 5700 
accauaucag aagcccucca gcaacuggcc aucaagaccu uuggccagcc ccccucgagc 5760 
ggugaugcag gcucguccac gggggcgggc gccgccgaau ccggcggucc gacguccccu 5820 
ggugagccgg cccccucaga gacagguucc gccuccvicua ugcccccccu cgagggggag 5880 
ccuggagauc cggaccugga gucugaucag guagagcuuc aaccuccccc ccaggggggg S940 
gggguagcuc ccgguucggg cucggggucu uggucuacuu gcuccgagga ggacgauacc 6000 
accgugugc\i gcuccauguc auacuccugg accggggcuc uaauaacucc cuguagcccc 6060 
gaagaggaaa aguugccaau caacccuuug aguaacucgc uguugcgaua ccauaacaag 6120 
guguacugua caacaucaaa gagcgccuca cagagggcua aaaagguaac uuuugacagg 6180 
acgcaagugc ucgacgccca uuaugacuca gucuuaaagg acaucaagcu agcggcuucc 6240 
aaggucagcg caaggcuccu caccuuggag gaggcgugcc aguugacucc accccauucu 6300 
gcaagaucca aguauggauu cggggccaag gagguacgca gcuuguccgg gagggccguu 63S0 
aaccacauca aguccgugug gaaggaccuc cuggaagacc cacaaacacc aauucccaca 642 0 
accaucaugg ccaaaaauga gguguucugc guggaccccg ccaagggggg uaagaaacca 6480 
gcucgccuca ucguuuaccc ugaccucggc guccgggucu gcgagaaaau ggcccucuau 654 0 
gacauuacac aaaagcuucc ucaggcggua augggagcuu ccuauggcuu ccaguacucc 6600 
ccugcccaac ggguggagua ucucuugaaa gcaugggcgg aaaagaagga ccccaugggu 6660 
uuuucguaug auacccgaug cuucgacuca accgucacug agagagacau caggaccgag 6720 
gaguccauau accaggccug cucccugccc gaggaggccc gcacugccau acacucgcug 6780 
acugagagac uuuacguagg agggcccaug uucaacagca agggucaaac cugcgguuac 68 4 0 
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agacguugcc gcgccagcgg ggugcuaacc acuagcaugg guaacaccau cacaugcuau 69 0^0 
gugaaagccc uagcggccug caaggcugcg gggauaguug cgeccacaau cucagaaagc 6960 
caggggacug aggaggacga gcggaaccug agagccuuca cggaggccau gaccagguac 7020 
ucugccccuc cuggugaucc ccccagaccg gaauaugacc uggagcuaau aacauccugu 7 08 0 
uccucaaaug ugucuguggc guugggcccg cggggccgcc gcagauacua ccugaccaga 714 0 
gacccaacca cuccacucgc ccgggcugcc ugggaaacag uuagacacuc cccuaucaau 7200 
ucauggcugg gaaacaucau ccaguaugcu ccaaccauau ggguucgcau gguccuaaug 7260 
acacacuucu ucuccauucu caugguccaa gacacccugg accagaaccu caacuuugag 7320 
auguauggau caguauacuc cgugaauccu uuggaccuuc cagccauaau ugagagguua 73 8 0 
cacgggcuug acgccuuuuc uaugcacaca uacucucacc acgaacugac gcggguggcu 7440 
ucagcccuca gaaaacuugg ggcgccaccc cucagggugu ggaagagucg ggcucgcgoa 7500 
gucagggcgu cccucaucuc ccguggaggg aaagcggccg uuugcggccg auaucucuuc 7560 
aauugggcgg ugaagaccaa gcucaaacuc acuccauugc cggaggcgcg ccuacuggac 7620 
uuauccaguu gguucaccgu cggcgccggc gggggcgaca uuuuucacag cgugucgcgc 7680 
gcccgacccc gcucauuacu cuucggccua cuccuacuuu ucguaggggu aggccucuuc 7740 
cuacuccccg cucgguagag cggcacacac uagguacaeu ccauagcuaa cuguuccuuu 7800 
uuuuuuuuuu uuuuuuuuuu uuuuuuuuuu uuuuuuuuuu cuuuuuuuuu uuuuucccuc 7 860 
uuucuucccu ucucaucuua uucuacuuuc uuucuuggug gcuccaucuu agcccuaguc 7920 
acggc\iagcu gugaaagguc cgugagccgc augacugcag agagugccgu aacuggucuc 7980 
ucugcagauc augu 7994 



<210? 9 
<211> 340 
<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; synthetic RNA 

<400> 9 

accugccccu aauaggggcg acacuccgcc augaaucacu ccccugugag gaacuacugu 60 
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cuucacgcag aaagcgccua gccauggcgu uaguaugagu gucguacagc cuccaggccc 120 

cccccucccg ggagagccau aguggucugc ggaaccggug aguacaccgg aauugccggg 180 

aagacugggu ccuuucuugg auaaacccac ucuaugcccg gccauuuggg cgugcccccg 240 

caagacugcu agccgaguag cguuggguug cgaaaggccu ugugguacug ccugauaggg 300 

cgcuugcgag ugccccggga ggucucguag accgugcacc 340 



<210> 10 
<211> 340 
<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; synthetic RNA 
<400> 10 

acccgccccu aauaggggcg acacuccgcc augaaucacu ccccugugag gaacuacugu 60 
cuucacgcag aaagcgucua gccauggcgu uaguaugagu gucguacagc cuccaggccc 12 0 
cccccucccg ggagagccau aguggucugc ggaaccggug aguacaccgg aauugccggg 18 0 
aagacugggu ccuuucuugg auaaacccac ucuaugcccg gccauuuggg cgugcccccg 24 0 
caagacugcu agccgaguag cguuggguug cgaaaggccu ugugguacug ccugauaggg 300 
ugcuugcgag ugccccggga ggucucguag accgugcacc 340 



<210> 11 

<211> 236 

<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic RNA 
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<400> 11 

agcggcacac aouagguaca cuocauagcu 
uuuuuuuuuu uuuuuuuuuu uucuuuuuuu 
uauucuacuu ucuuucuugg uggcuccauc 
uccgugagcc gcaugacugc agagagugcc 



aacuguuccu uuuuuuuuuu uuuuuuuuuu 60 
uuuuuuuccc ucuuucuuGc cuucucaucu 12 0 
uuagcecuag ucacggcuag cugugaaagg 180 
guaacugguc ucucugcaga ucaugu 23 6 



<210> 12 

<211> 232 

<2a2> ENA 

<2X3> Artificial Sequence 



<220> 

«:223> Description of Artificial Sequence: synthetic RMA 



<400> 12 

agcggcacac auuagcuaca cuccauagcu 

uuuuuuuuuu uuuuuuucuu uuuuuuuuuu 

cuacuuucuu ucuugguggc uccaucuuag 

ugagccgcau gacugcagag agugccguaa 



aacuguuccu uuuuuuuuuu uuuuuuuuuu 60 
uuucccucuu ucuucccuuc ucaucuuauu 120 

cccuggucac ggcuagcugu gaaagguccg 180 
cuggucucuc ugcagaucau gu 232 



<:21D> 13 

<211> 17 

<212> DNA 

<213> Artificial Sequence 



<223> Description of Artificial Sequence: synthetic DNA 



<400> 13 

cgggagagcc atagtgg 
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<210> 14 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 14 

agtaccacaa ggcctttcg 



<210> 15 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 

<400> 15 

ctgcggaacc ggtgagtaca c 

<210> 16 

<211> 20 

<212> DNA 

<213> Artificial Sequence 



of Artificial Sequence; synthetic DNA 
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<400> 16 

aacaagatgg attgcacgca 20 

<210> 17 
<211> 20 
<212> DHA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 17 

cgtcaagaag gcgatagaag 20 

<210> 18 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 

<400> IS 

gcactctctg cagtcatgcg gctcacggac 30 

<210> 19 
<211> 28 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; synthetic DNA 
<400> 19 

cccctgtgag gaactactgt cttcacgc 28 

<210> 20 
<211> 24 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400=. 20 

ccgggagagc catagtggtc tgcg 24 

<210> 21 
<211> 30 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 21 

ccactcaaag aaaaagtgtg acgagctcgc 30 
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<210> 22 

<211> 18 

<2a2> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 

<400> 22 

ggcttgggca cggcctga 18 

<210> 23 
<211> 30 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: synthetic DBA 
<400> 23 

gcggtgaaga ccaagctcaa actcactoca 30 

<210> 24 
<211> 21 
<212> DNA 

<212> Artificial Sequence 

<;220> 

<223> Description of Artificial Sequence: synthetic DNA 
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<400> 24 

agaacctgcg tgcaatccat c 21 

<210> 25 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description ot Artificial Sequence: synthetic DMA 
<400> 25 

cccgtcatga gggcgtcggt ggc 23 

<210> 26 
<211> 27 
<212> DNA 

«:213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DHA 
<400> 26 

accagcaacg gtgggcggtt ggtaato 27 

<210> 27 
<211> 18 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; synthetic DNA 
<400> 27 

ggcacgcgac acgctgtg ' 18 



<210> 28 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 28 

agctagccgt gactagggct aagatggagc 3 0 

[Sequence Listing Free Text] 

SEQ ID NOS: 1, 2, 7 and 8 set forth the sequences of replicons. 
SEQ ID NOS: 9 to 12 set forth the sequences of synthetic RNAs. 
SEQ ID NOS: 13 to 28 set forth the sequences of synthetic DNAs. 
[Brief Description of Drawings] 
[Fig. I] 

Fig, 1 is a schematic view showing procedures for constructing a template DNA 

for preparing the HCV-RNA replicon according to the present invention. The 
upper section of Fig. 1 shows the structure of the region within pJFHl or pJCHl, 
with the viral genome inserted into it. The lower section of Fig. 1 shows the 
structure of the region within plasmid DNA pSGREP-JFHl or pSGREP-JCHl, 
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with the vhal genome inserted into it, that had been constructed by substituting a 
part of viral genome- inserted region of pJFHl or pJCHI with a DNA fragment 
containing a neomycin resistance gene and EMCV IRES. Symbols in Fig. 1 are 
as described below. T7, T7 RNA promoter; G, dGTP that was inserted upstream 
of the 5' end of the inserted DNA derived from JFH-1 or JCH-1 and downstream of 
the 3' end of T7 RNA promoter sequence; 5' NTR, 5' untranslated region; Core, 
core protein; and 3' NTR, 3' untranslated region. El and E2 represent envelope 
proteins. NS2, NS3, NS4A, NS4B, NS5A and NS5B represent non-structural 
proteins. Age I, Cla I and Xba I represent cleavage sites of restriction enzymes 
Age I, Cla I and Xba I, respectively. ODD, the position of amino acid motif ODD 
corresponding to the active center of NS5B protein; neo, neomycin resistance 
gene; and EMCV IRES, internal ribosome entry site of encephalomyocarditis virus 
(EMCV IRES). 

[Fig. 2A] 

Fig. 2 shows the nucleotide sequence of rSGREP-JFHl. 

[Fig. 2B] 

Fig. 2 shows the nucleotide sequence of rSGREP-JFHI. 
[Fig, 2C] 

Fig. 2 shows the nucleotide sequence of rSGREP-JFHI. 
[Fig. 2D] 

Fig. 2 shows the nucleotide sequence of rSGREP-JFHI. 
[Fig. 2E] 

Fig. 2 shows the nucleotide sequence of rSGREP-JFHI. 

[Fig. 2F] 

Fig. 2 shows the nucleotide sequence of rSGREP-JFHI. 
[Fig. 3A] 

Fig. 3 shows the nucleotide sequence of rSGREP-JCHl. 
[Fig. 3B] 

Fig. 3 shows the nucleotide sequence of rSGREP-JCHl. 
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[Fig. 3C] 

Fig. 3 sliows the nucleotide sequence of iSGREP-JCHl. 
[Fig. 3D] 

Fig. 3 shows ti;e nucleotide sequence of rSGREP-JCHl. 
[Fig. 3E] 

Fig. 3 shows the nucleotide sequence of rSGREP-JCHI . 
[Fig. 3F] 

Fig. 3 shows the nucleotide sequence of rSGREP-JCHI. 

[Fig. 4] 

Fig. 4 shows photographs showing the colony formation of Huh? cells to which 
rSGREP-JFHl, rSGREP-JFHl/GND and rSGREP-JFHl/dGDD was transfected, 
respectively, The amount of each of three transfected RNAs in the upper section 
was 100 ng and that of three transfected RNAs in the lower section was 300 ng, 
[Fig. 5] 

Fig. 5 shows photographs showing colony formation of Huh7 cells to which 
rSGREP-JFHl and rSGREP-JCHI respectively had been transfected when the 
concentration of G418 was 0.5 mg/ml of the medium. The amount of each of 
these RNAs transfected was 1 00 ng. 
[Fig. 6] 

Fig. 6 shows photographs showing the effect of Mung Bean Nuclease treatment 
conducted on the colony-forming ability of the transfected cells. The amount of 
rSGREP-JFHl RNA transfected was 100 ng for both cases. The concentration of 
G418 was 1.0 mg/ml in both media. 

[Fig. 7] 

Fig. 7 shows photographs showing colony formation when total cellular RNA 
derived from the replicon-replicating cell clone, which had been established by 
transfection of rSGREP-JFHl, was retransfected to another Huh? cells. The 
photograph on the left shows that the formation of 96 colonies was observed as a 
result, when using the total cellular RNA derived from the replicon-replicating cell 
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clone No. 6. The photograph on the right shows that the formation of 77 colonies 
was observed as a result, when using the total cellular RNA derived from the pool 
clones. In both cases, RNA was retransfected in an amount containing IxlO' 
copies of the replicon RNA. 
[Fig. 8] 

Fig. 8 shows photographs showing the results of detecting by the Northern blot 
method using an rSGREP-JFHl -specific probe for the total RNA derived from a 
cell clone that had been obtained by retransfecting the total cellular RNA (derived 
from the replicon-replicating cell clone established by transfection of rSGREP- 
JFHl) into another Huh7 cells. Explanation of the lanes is as follows. 10^ 
represents sample prepared by adding 10* copies of the replicon RNA synthesized 
in vitro to total RNA extracted from Huh7 cells. lO' represents sample prepared 
by adding lO' copies of the replicon RNA synthesized in vitro to total RNA 
extracted from Huh7 cells. Huh7, total RNA extracted from untransfected Huh7 
cells; pool clone, total RNA extracted from the pool clones; and l-ll, total RNA 
extracted from each of cell clones Nos. 1 to 11. "Replicon RNA" represents the 
electrophoresed position of a molecular weight marker indicating the size of 
rSGREP-JFHl, "28S" represents the same of a ribosomal RNA marker indicating 
the size of molecular weight of 4.5 kb, and "18S" represents the same of a 
ribosomal RNA marker indicating the size of molecular weight of 1.9 kb. 
[Fig. 9] 

Fig. 9 shows photographs showing the presence or the absence of the 
incorporation of a neomycin resistance gene into the genomic DNA of a host cell 
in the cell clone to which rSGREP-JFHI- or rSGREP-JCHI -derived replicated 
replicon RNA was retransfected. Explanation of the lanes in the photograph on 
the left is as follows. M, DNA molecular weight marker; 1-8, rSGREP-JFHI- 
derived cell clones Nos. 1 to 8; N, untransfected Huh7 cells; and P, positive 
control (PGR amplification product of the neomycin resistance gene). 
Furthermore, explanation of the lanes in the photograph on the right is as follows. 
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M, DNA molecular weight marker; and 1-6, rSGREP-JCHl -derived cell clones 

Nos. 1 to 6. 

[Fig. 10] 

Fig. 10 shows photographs showing the results of detecting NS3 protein expressed 
in the cell clone that was retransfected with rSGREP-JFHl- or rSGREP-JCHl- 
derived replicated replicon RNA. Lanes I to 8 of the photograph on the left 
represent rSGREP-JFHl -derived cell clones Nos. 1 to 8. Lanes 1-6 of the 
photograph on the right represent rSGREP-JCHI -derived cell clones Nos. 1 to 6. 
Lane P of the photograph on tiie right represents NS3 protein (positive control) 
and N represents protein extracted from untransfected Huh7 cells (negative 
control). 

[Fig. 11] 

Fig, 11 shows the positions of nucleotide mutations in replicon RNAs obtained 
from 21 cell clones that were established through the re-transfection of rSGREP- 
JFHl -derived replicated replicon RNA Into Huh7 cells. Mutation positions are 
indicated using bar lines shown with nucleotide numbers listed in Table 4. A 
thick bar line denotes nonsynonymous substitution and a thin bar line denotes 
synonymous substitution. 
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[Title of Document] Drawings 
[Figure 1] 
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[Figure 2A] 
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[Figure 2B] 
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[Figure 2C] 
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[Figure 2D] 
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[Figure 2E] 
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[Figure 2F] 
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[Figure 3 A] 
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[Figure 3B] 
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[Figure 3D] 
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[Figure 3E] 
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[Figure 3F] 



&H0 ^920 6M0 5S40 6330 

COAAWyKaX yAGOSeCEKjS CMCSOXSCQ GCOfttrflStfOiJ 0G£3KAQ<SflU CCUGOUAUeC 

$&75 63a0 6590 7000 'JOXO '3020 

GsoeKcaaco uiSGtsfcoyeAu cucsafiM<5C cACfOGGAquQ AGeRe<5^CGA gcggm-ccw 

7039 lOiQ 7050 7g<5Q 7^70 7t5ftO 

mmcsmch osamooohv tSACcsmm} ws3soqc«jc okssogaocx! ccccasroog 

•>090 7100 7110 7120 7130 Uif) 

<3?AmyeACc oscagcdsw) »ACAuco?3evi tx:cw:»wi^ yGocixsyi^ AcmsQCcca 

7150 7160 7170 7160 7190 1200 

CM$eeCCfiOC GCi\3»Qgcm. OSJG&CCJ^ SAGOQC^i^Cft a)IK3k&UU@C COf!(i(!axSCC 

73i0 7»3<y 7230 " 7240 725Q 7260 

DdaSMOCfeS UlRjJ&CI^CUiC OQCOSOQUIV UCSM^S^CUKIO CJ^C&OOVCJ CC&6U%D9CO 

7270 7390 7290 7300 "7310 7330 

ccftftCCAtiflu GseouoGcat? ©eoccosJiUQ ACflCROJuco yccccMJtxsr cmxjgocscsw 

733Q 7340 . 7350 13^0 7370 7360 

QKChCCaiia AOC&(3R60CO OiyiCWWSJl& AUiSuaO^O OGGOSOacOO OQ^;i£OC0U 

73S0 7400 7m .7420 7430 7440 

coQSAiccucc: ms^xsmm u&tA3t^exm <%Oi3esco»3 ftoisccocxxxr wuBcacniCA 

7450 74«0 7470 7480 7490 7500 

UftCaCUCCCC ftOSfiACWJRO OC<5<Si3aa5ar t)CAGC0a3CA GAMJsCXJUGO Gi30SC5CACCC 

7510 7520 7S30 7540 7550 7550 

C^CftQAdOiSO GGiVu3i;u»K;S ^SO^iWOQh ^ftO9906U C(::x;UCM)CUC CCGtXiOGQSS 

7370 7B&0 7590 7€6o 7«0 7620 

i^xi&QBi&xnt Tmsamm GGKcaxms Mcoddecss tisi^MKau^ <3aKWC(K; 

7S30 7S40 7fi50 7660 7670 76«0 

AoiocswEfsc ce3&3scR08 GcwojecjAo uixioc<ayw« esuooAccsso owosasegc 

7690 7700 7710 7720 7730 7740 

Q&SSCKfS}VCh DUdAUCApAG) CQUQUOStSSu OCCOSftOCCC QQCOAOUAqO CXX^&GCCUA 

77S0 7760 7770 776(1 779& 7000 

oxxxmm coeowscssu aeeotwcww ctwcsKsoos «wo3AtaissiG oGocacAcao 

78S0 7829 7830 7840 7680 . 78«0 

uftscuftCAO) "CcauAsccM ojsuucBoyu wwKWowm wooowDooa trauaomiooo 

7870 7880 78$$ 7S0S 7910 7920 

tmxKJUDOOo ouuuugixn? vomxxxioa mxmcmj mi&xms& wosbfxmi 

?»0 794Q 795D 7960 7^70 ' 7380 

mmnm» gcoccaqcmi ^i&xcfm^ ao^oomsco dusMmBOG oamacoac 

7990 dOOO 8D1D S020 8030 84340 

mimism tigmoacQf^ JAQnamxio wn^t^nw i^ijm 



162 



JP /^plication No. 2003-148242 



[Figure 4] 
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[Figure 5] 
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[Figure 6] 
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[Figure 7] 




166 



JP Application No. 2003-148242 



[Figure 8] 
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[Figure 9] 
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[Figure 10] 
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[Figure U] 
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[Title of Document] ABSTRACT 
[Abstract] 

[Technical Problem] An object is to provide a replicoii RNA that is derived from 
HCV of a different genotype from genotype lb, 

[Technical Solution] A replicon RNA comprising a nucleotide sequence at least 
containing the 5' untranslated region, the sequence encoding NS3 protein, NS4A 
protein, NS4B protein, NS5A protein and NS5B protein, and the 3' untranslated 
region on the genomic RNA of hepatitis C virus of genotype 2a is provided. 
[Selected Drawing] None 
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